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GENES THAT REGULATE HEMATOPOIETIC BLOOD FORMING STEM 
CELLS AND USES THEREOF 

5 



GOVERNMEN T A L S UPPORT 
This invention was made with the support of National Institutes of Health Grant 
No. ROl CA45339-09 and American Cancer Society No. DHP- 144/01. The United 
10 States government may have certain rights to this invention. 

Throughout this application, various publications are referenced by number. Full 
citations for these publications may be found listed at the end of the specification and 
preceding the Claims. The disclosures of these publications in their entireties are hereby 
incorporated by reference into this application in order to more fully describe the state of 
15 the art. A Sequence Listing is provided. 



FIELD OF THE INVENTION 
The present invention relates to hematopoietic stem cells and the stem cell and support 
cell genes that support stem cell replication and differentiation. 

20 

BACKGROUND OF THE INVENTION 
The adult hematopoietic system is organized as a hierarchy of cells with decreasing self- 
renewal and multilineage differentiation potential. This is accompanied by progressively 
larger numbers of more mature cells and an increasing tendency to be in active cell cycle 

25 (Lemischka, I.R., 1992; Morrison, S.J., et al. 1995). Collectively, the properties of this 
hierarchical system result in the balanced, lifelong production of at least eight distinct cell 
lineages. A population of stem cells establishes the entire hierarchy; therefore, in order to 
understand the fundamental mechanisms responsible for normal hematopoiesis it is 
ultimately necessary to understand the biology of the stem cells. 

30 Most information concerning the biology of stem cells has been obtained from the 

mouse model. In this system, the most critical, characteristic property of the stem cell 
population has been defined; that is, its ability to reconstitute a normal blood system in a 
transplanted host. A number of variations on the basic transplantation assay have been 
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described (Harrison, D.E. 1980; Spangrude, G.J., et al. 1995). All of these systems, 
together with the appropriate donor vs. host or clonotypic markers have rigorously 
defined the most primitive stem cells and have provided a description of their 
developmental properties. Perhaps the most striking characteristics of this cell population 
5 come from retroviral "marking" studies (Leminschka, LR. 1992). These studies clearly 
show that a single stem cell clone is both necessary and sufficient, not only to sustain 
lifelong, multilineage hematopoiesis in one primary recipient but in numerous secondary 
animals. This illustrates the remarkable proliferative potential of the stem cell and 
directly demonstrates stem cell self-renewal. A major advance in mouse stem cell 

10 biology was the development of strategies which facilitate the substantial enrichment of 
stem cell activity (Bauman, J.G., et al. 1988; Spangrude, G.J., et al. 1988; Jordan C.T. , et 
al 1990). Purification procedures enabled the first direct approaches to unravel the 
mechanisms responsible for the unique biological properties of the stem cell population. 
A key observation was that the phenotypically defined stem/progenitor cell population is 

15 heterogeneous with respect to in vivo functional properties (Fleming, W.G., et al 1993; 
Li, C.L. and Johnson G.R. 1992; Spangrude, G.J. and Johnson, G.R. 1990, Jones R., et 
al. 1990; Uchida, N., et al. 1993). In addition to the in vivo repopulating cells, other 
primitive progenitor cells are often contained in a purified population (Weilbaecher, K., 
1991; Trevisan, M. and Iscove, N.N. 1995; Ogawa, M. 1993). These can be assayed in a 

20 variety of in vitro systems. Whether all of these in vivo and in vitro activities represent 
discrete cell subpopulations or whether there is a continuum of functional potential is still 
an unanswered question. Recent studies have suggested distinct physical properties for 
functionally different activities within the primitive population Morrison, S.J., and 
Weissman, I.L. 1994); Morrison, SJ. et al. 1997; Jones, R.M., el al. 1996). A second set 

25 of observations revealed an inverse correlation between a tendency for active cell cycling 
and primitive, uncommitted developmental potential in BM (Spangrude, G.J., and 
Johnson, G.R. 1990). In fetal liver a higher proportion of primitive stem cells is actively 
cycling (Fleming, W.H., et al. 1993). Moreover, it has been shown that fetal stem cells 
are more potent than adult stem cells in LTRA (Jordan, C.T., et al. 1995; Pawliuk, R., et 

30 al 1996). These are exciting observations because they suggest that rapid stem cell 
cycling can be compatible with the maintenance of primitive in vivo activity. Very recent 
studies suggest that the adult BM stem cell compartment may in fact be cycling at a very 
slow rate (Bradford, G.B., et al. 1997) . Clearly, stem cell cycle regulation is a critical 
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area for investigation. Stem cell purification has facilitated studies aimed at ex vivo 
maintenance or expansion of the most primitive, transplantable stem cell. Most culture 
systems strongly favor a differentiation process (Van der Sluijs, J.P., et al. 1993; 
Traycoff, CM, et al. 1996; Peters, S.O, et al. 1995; Knobel, K.M, et al. 1994). 
However, several recent reports have been encouraging. It has been shown that colonies 
grown in defined cytokines can retain not only myeloid and erythroid but also lymphoid 
potentials (Ball, T.C., et al. 1995). Moreover, the short-term (2-3 weeks) maintenance of 
LTRA has been demonstrated in suspension cultures supported by IL6, IL11, together 
with ckit ligand (KL) or flk2/flt3 ligand (FL) (Yonemura, Y.H, et al. 1997). A recent 
report has shown that colonies initiated in cytokine-supplemented semisolid cultures 
retain LTRA (Trevisan, M., et al. 1996). The studies described herein have developed a 
stromal cell line supported system which quantitatively maintains LTRA for an extended 
(4-7 weeks) time (Moore, F.A., et al. 1997). 

In the human system it is clearly not possible to do the same kind of extensive 
marking and transplantation assays. However, several xenograft model systems have 
been developed to assess the in vivo behavior of human stem cells (Traycoff, C, et al. 
1994; Turner, C, et al. 1996; Cashman, J., et al. 1997) . Some of these experiments can 
be done quantitatively in limiting dilution (Bhatia, M., et al. 1997). A very recent study 
has demonstrated a common proviral integration site in granulocyte macrophages and T- 
cells derived from beige/nude/XID mice 7-11 months after engraftment with genetically 
transduced human stem cells (Nolta, J.A, et al. 1996) . This important study paves the 
way for precise in vivo clonal analyses. The largest amount of functional information 
about human stem/progenitor cells has been obtained in vitro using a wide range of 
stromal cell and cytokine supported culture systems. It is not possible herein to describe 
and properly accredit all of the important studies, however several advances deserve 
mention. The long-term culture-initiating cell (LTCIC) assay measures the in vitro 
production of colony forming cells (CFC) after periods of at least five weeks in culture 
(Sutherland, H.S, et al. 1989) . The cells producing these CFC derive from a population 
of cells which, at least to some extent, probably overlaps with the most primitive 
compartment. The maintenance and expansion of primitive functional abilities in this 
culture system has recently been documented (Petzer, A.L., et al. 1996) . A variation on 
this assay system, the extended LTCIC (ELTCIC) has been suggested to measure an even 
more primitive cell population in BM and CB (Hao, Q.L., et al. 1996) . A very exciting 
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prospect for the near future will be the integration of the various in vivo xenograft assays 
with the in vitro LTCIC and ELTCIC systems. Some very recent efforts have suggested 
that the NODSCID xenograft system and the LTCIC assay may measure distinct 
stem/progenitor subsets (Larochelle, A.. J., et ah 1997). Clearly, much more work needs 
5 to be done, however, it may be anticipated that the ELTCIC system will provide the 
"bridge" in this continuum. Collectively, and including the various strictly cytokine- 
driven systems, the above studies illustrate the current possibility to accurately and 
quantitatively reveal the majority (if not all) functional entities in the human 
stem/progenitor cell hierarchy. The physical characterization and purification of human 

10 stem/progenitor cells has proceeded along lines which are parallel to the mouse system. 
Indeed, because of clinical impetus, it can be argued that they are further advanced. 
Thus, as measured in the range of assays discussed above, the consensus physical 
phenotype of the most primitive portion of the human stem/progenitor hierarchy is 
CD34+Lin-CD38- (Terstappen, L.W.M.M., et al. 1991). The CD34+Lin-CD38+ subset 

15 contains less primitive, more committed cellular entities. Other studies have shown that, 
similar to the mouse, low level expression of Thyl (CD90) is a feature of the primitive 
human stem cell (CD34+Lin-CD90+)(Baum, CM., et al. 1992; Craig, W., et al. 1993). 
Most CD90+ cells in this compartment are CD38-. Therefore, the consensus phenotype 
can be described as CD34+Lin-CD90+ (Craig, W., et al. 1993). Two potential 

20 differences with the murine system can now be highlighted. First, a very recent and 
elegant study has shown that the most primitive mouse stem cell may in fact be CD34-/lo 
(Osawa, M., et al. 1996) . Whether this is a genuine difference or whether it reflects the 
ability to perform more accurate long-term engraftment studies in the mouse remains to 
be determined. Second, it has been suggested that in the mouse, CD38 expression is a 

25 positive indicator for primitive stem cell function in a purified population (Randall, T.D., 
et al. 1996) . As in the mouse, human stem/progenitor cells have been identified and 
purified from various sources. These include: adult BM (Baum, CM., et al. 1992), CB 
(DiGiusto, D.L., et al 1996) , fetal liver (Craig, W, et al. 1993) and peripheral blood 
stem cells after various mobilization protocols (Murray, L., et al. 1994). Similar to the 

30 data obtained in the murine system, comparative studies reveal that, in general, the basic 
and fundamental functional properties of stem/progenitor cells are shared regardless of 
the tissue source. There are however, significant functional and physical differences. 
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Numerous insights into hematopoietic molecular control mechanisms have come 
from gene-targeting studies in mice. Mutations in specific genes, most notably, those 
encoding transcription or DNA-binding proteins, have profound cell-intrinsic, global or 
lineage-specific effects on hematopoietic development (Shivdasani, R.A., and Orkin, S.H. 
5 1996; Orkin, S.H. 1996). In the latter cases, it is tempting to speculate that the phenotypes 
result from defects in the commitment process. However, malfunctions in the 
commitment decision to "set up" a program of differentiation are difficult to distinguish 
from malfunctions in the differentiation program itself. Two gene products, AML1 
(CBF2) and SCL (tal-l) appear to be necessary global regulators of hematopoiesis 

10 (Wang, W„ et al. 1996; Okuda, T., et al. 1996; Porcher, C, et al. 1996; Robb, L. , et al. 
1996) . Whether these molecules act to specify a hematopoietic stem cell or by other 
means is an open question. Interestingly, both of these molecules play roles in leukemic 
transformation. A very important gain of function study documents the apparent ability 
of HOXB4 to increase primitive cell numbers without significant impairment of 

15 differentiation abilities (Sauvageau, G., et al. 1995) . Together with observations that 
HOXA9 is translocated in myeloid leukemia (Nakamura, T., et al. 1996; Borrow, J., et al. 
1996) , these studies suggest an important hematopoietic role for homeobox proteins. 
Without question, the above and other studies have identified important regulators of 
hematopoiesis. However, in almost all cases these regulators were first identified in other 

20 systems. The opposite approach is to directly search for stem cell regulators in stem cells, 
Graf, L., and Torok-Storb, B. 1995; Yang, Y., et al. 1996). The present invention solves 
these problems. 

SUMMARY OF THE INVENTION 

25 The human hematopoietic stem/progenitor cell population has been extensively 

characterized according to physical and antigenic criteria as well as in a variety of in vitro 
and in vivo assay systems. Collectively the human studies have revealed similarities to 
the hierarchical stem/progenitor cell organization defined in the murine system. In spite 
of significant strides in the identification of cytokines which can act on stem cells, it has 

30 not been possible to define a system where undifferentiated expansion of the most 
primitive stem cell population occurs. Similarly, it has not been possible to direct 
differentiation along lineage-specific pathways. These limitations, which also apply to 
the murine system, have hampered the elucidation of regulatory mechanisms which 
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mediate the most fundamental aspect of stem cell biology; that is, the decision to self- 
renew or commit to differentiation. As a consequence, very little is known about the 
molecular biology of the most primitive hematopoietic stem cell in any organism. It was 
hypothesized that the regulation of primitive stem cells will be mediated at least in part by 
5 the products of genes which are uniquely or predominantly expressed in these cells. One 
precedent for an important, differentially-expressed molecule is the flk2/flt3 receptor 
tyrosine kinase. There presumably are other important and differentially expressed gene 
products. Therefore, it is an object of the present invention to identify these molecules 
and address their functional roles. Specifically, an aspect of identification of gene 

10 expression patterns specific to primitive human stem cells is the molecular phenotype of 
the human stem cell. The present invention describes methods to define the profile of 
genes specifically expressed in undifferentiated human stem/progenitor cell populations. 

A primary focus of the present invention is on primitive cells isolated from normal 
bone marrow (BM) samples. The present invention further comprehends use of other 

15 sources of stem cells, such as umbilical cord blood (CB). 

The methods of the present invention combine diverse technical approaches and 
sophisticated bioinformatic analyses. 

This invention further provides methods to identify genes whose expression can 
be modulated by cytokine or stromal-dependent culture and/or by cell-cycle status. 

20 Another object of the present invention is to provide methods for the functional 

characterization of human stem cell-specific gene products. An aspect of this invention is 
a method to facilitate the functional characterization of specifically expressed gene 
products as candidate regulators of a variety of stem/progenitor cell processes. In 
particular, a provided method uses an in vitro system which approximates many 

25 characteristic properties of normal stem cells to analyze positive and negative regulation 
of proliferation, cell-cycle parameters, apoptosis and commitment. 

It is a further goal of the present invention to provide a necessary (and usually 
missing) component for stem cell gene-expression screens; that is, the ability to quickly 
assess the function of extensive panels of genes. 

30 It is also an object of the present invention to provide a method for the functional 

identification of stem cell regulators. An aspect of this invention is a facile screening 
method for "categorizing" large populations of specifically-expressed molecules 
according to their potential roles in a variety of stem/progenitor cell processes. Gain of 
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function as well as loss of function approaches are contemplate by the present invention. 
This method does not rely on any prior knowledge of nucleotide or predicted protein 
sequence. 

Accordingly, the present invention provides an isolated nucleic acid 
5 derived from an isolated hematopoietic stem cell, the isolated nucleic acid comprising the 
following characteristics: (1) specifically expressed in the hematopoietic stem cell; and 
(2) 

encoding a hematopoietic stem cell - specific protein. 

The present invention additionally provides an isolated hematopoietic stem cell 
10 specific protein or a portion thereof encoded by the provided nucleic acid. The present 

invention further still provides a nucleic acid probe capable of specifically hybridizing 

with the provided nucleic acid under standard hybridization conditions. 

Also, the present invention provides an antibody capable of specifically binding to 

the provided protein without substantially cross-reacting with a non-stem cell specific 
15 protein or homologs thereof under conditions permissive to antibody binding. 

Additionally, the present invention provides a cell capable of producing the provided 

antibody. 

In addition, the present invention provides a method for identifying the presence 
of a primitive hemopoietic stem cell in a sample comprising nucleic acids specifically 
20 expressed in hematopoietic stem cells. Further still, the present invention provides a 
method for generating a stem cell/progenitor cell from a primitive hematopoietic cell in a 
sample. 

The present invention further provides a method for identifying the presence in a 
sample of a compound that modulates hematopoietic stem cell activity. 
25 The present invention even further still provides a method for identifying 

primitive hematopoietic stem cell-specific nucleic acids. 

Also the present invention additionally provides a molecularly defined primitive 
hematopoietic stem cell. 

Yet additionally, the present invention provides a method for treating a condition 
30 in a subject comprising administering to the subject a therapeutically effective amount of 
a provided pharmaceutical composition. The present invention provides a method of 
introducing an exogenous nucleic acid into a hematopoietic stem cell. 
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Finally, the present invention provides a method of ex vivo expansion of hematopoietic 
stem cells. The expanded cells are available to receive exogenous genes, including by 
retroviral or other vectors which require a round of replication. Alternatively, the cells 
are available for transplantation either autologously or heterologously. 

5 

BRIEF DESCRIPTION OF THE FIGURES 
FIGURE 1 . Schematic "flow of information" strategy. 

10 

FIGURE 2. Mammalian stem cell system (black circle = strong hybridization signal, 
gray circle = detectable hybridization signal, and open circle = no detectable 
hybridization signal). The Smc-34 cDNA is a completely novel sequence with a 
predicted leucine zipper and several potential membrane spanning domains. 

15 

FIGURES 3A-3B, (right panel), the control, non-subtracted RDA cDNA population (38- 
) contains b-actin sequences which are missing in the two subtracted RDA populations, 
38- and 38-)38+. A differentially expressed gene (HDD-2, described below) is enriched 
in the 38- RDA population and at least retained in the 38-)38+ RDA population (Figure 
20 3A, left panel). Two, bi-directional, RDA cDNA populations (38- and the converse 34- 
)38) were used to probe (See Figure 3) duplicate arrays of a subtracted 38- library (Figure 
3B). 

FIGURE 4A-4F Figure 4A: 34B4 (SEQ.ID.No.: 69) is closely related to a gene 
25 encoding TINUR The sequence homologies and restricted expression pattern of 34A5 is 
shown in Figures 4B and 4C. In Figure 4C and 4D (and also 5a, 5b, and 6b) there are 
twenty-one samples of capfinder-amplified cDNA from various hematopoietic 
populations. From left to right these are: four CD34+Lin- populations, three CD34+Lin- 
CD90+ populations, two CD34- populations, four CD34+Lin-CD38+ (obtained from the 
30 same BMs as the CD38- samples in lanes 1-4), two CD34+Lin- samples (obtained from 
the same BMs as the CD90+ samples in lanes 5 and 6), three CD34+Lin- populations 
obtained after 1, 2 or 4 days of culture and finally their three CD34+Lin-CD38+ 
counterparts. The 38G2 cDNA (SEQ.ID.No.: 70) is closely homologous to the 
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LTG9/MLLT3 gene located on 9q22 and involved in t(9;ll) leukemia (Iida, S., et al. 
1993) (Figure 4E). 

FIGURES 5A-5B. G0S3, a fos-related gene (Heximer, S., et al. 1996) (Figure 5A) and 
5 (2) HLA-DR (Figure 5B). 

FIGURES 6A-6D. Figure 6A. HDD-2 is about 500 bp. It contains a short open reading 
frame of 89 amino acids (SEQ.ID.No.: 71). The predicted peptide sequence is shown. 
Figure 6B. The expression profile of HDD-2 demonstrates that it is stem cell restricted. 
10 Figure 6C HDD-2 hybridization to a dot blot with numerous human pA+ mRNA 
samples (Clontech). HDD-2 hybridization is only visible in kidney (the other "spots" are 
background). Figure 6D. Genomic Southern blot confirmed that HDD-2 corresponds to 
a single-copy human gene. 

15 FIGURE 7- Immunoprecipitation analysis of protein extracts using rat IgG2b isotype 
control antibody (IgG) or AA4.1 mAb. Protein extracts were prep-ared from D2N cells; 
M2.4 cells; AA4-depleted fetal liver cells (FLAA4-); AA4enriched fetal liver cells 
(FLAA4+); AA4-depleted bone marrow cells (BMAA4-); AA4-enriched bone marrow 
cells (BMAA4+). Indicated on the right are positions of molecular weight markers. 

20 

FIGURES 8A-8C. AA4 expression in retrovirus infected cells. Fi gure 8A . Flow 
cytometry analysis of NIH 3T3 and EML CI cells using PE-conjugated AA4.1 antibody 
before and after infection with REBNA/AA4. Figure 8B . Immunoprecipitation of 
biotin-labeled surface proteins using rat IgG2b isotype control antibody (lanes 1, 3, 5) or 

25 AA4.1 mAb (lanes 2, 4, 6). Cellular extracts were prepared from D2N cells (lanes 1 and 
2), REBNA/AA4 infected NIH 3T3 fibroblasts (lanes 3 and 4), and REBNA/AA4 
infected EML CI cells (lanes 5 and 6). Fi gure 8C . Immunoprecipitation of cellular 
extracts using AA4.1. REBNA/AA4 NIH 3T3 cells (lanes 2 - 4) and REBNA/AA4 EML 
CI cells (lanes 6 - 8) were labeled with 35S-methionine and chased with nonradioactive 

30 media for 10 min. (lanes 3 and 7) or 20 min. (lanes 4 and 8). REBNA/GFP NIH 3T3 
(lane 1) and REBNA/GFP EML CI cells (lane 5) are shown as controls. Indicated on the 
right are positions of molecular weight markers. 
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FIGURE 9. Nucleotide sequence (SEQ.ID.No.: 72) and the deduced amino acid 
sequence of AA4 (SEQ.ID.No.: 73). Amino acid sequences is numbered 1 with initiator 
methionine. Underlined is the putative signal peptide in AA4. The transmembrane 
domain is double underlined. Light grey line indicates C-type lectin carbohydrate 
5 recognition domain. Dark grey lines show EGF-like repeats, while broken, grey lines 
indicate calcium-binding EGF motifs. 

FIGURES IQA-lftB. Comparison of the primary structures of AA4 and human ClqR. 
.Figure 1QA. Alignment of amino acid sequences of AA4 (SEQ.ID.No.: 73) and ClqR. 
10 Figure 1QB. Comparison of the domain structures of AA4 and ClqR. The proteins 
contain N-terminal signal peptides (SP), a C-type lectin recognition domain, six EGF-like 
domains, including three calcium-binding (cb) EGF-like domains, and a transmembrane 
domain (TM). 

15 FIQIJRE II. In normal tissues and transformed cell lines, a 7kb RNA species hybridizes 
with the cloned cDNA (Figures 11A and 1 IB). In addition to the 7kb species, poly(A)- 
RNA from D2N cells contains a minor band corresponding to a 3.2 kb mRNA (Figure 
11 A, lane 8). 

20 FIGURES 12A-12B. Northern blot analysis of AA4 expression in transformed cell lines 
Figure 12A and normal mouse tissues Figure 12B. Indicated on the left are positions 
of 9.44 - 0.24 kb RNA molecular weight markers. Hybridization with D2N poly(A)+ 
RNA is shown after a 2 hr and overnight exposures. Hybridizations with GAPDH are 
shown as controls for equal loading. 

25 

FIGURES 13A-13C. RT-PCR of cDNAs prepared from murine fetal liver, (A) bone 
marrow-derived hematopoietic cells (B), or differentiating ES cells (C). Fi gure 13 A. 
Lanes 1 and 2, AA4- cells; lane 3, AA4+ cells; lane 4, AA4+Sca-l+c-Kit+LinIO cells; 
lane 5, AA4+Sca-l-c-Kit+LinIO cells. Figure 13B. Lane 1, RholOSca-l+Thy- 
30 1 lOLincells; lane 2, RhohiSca-l+Thy-1 lOLin- cells; lane 3, Sca-l+Thy-l-Lin- cells; lane 
4, Lin+ cells. Figure \3C f ESO, undifferentiated ES cells; BL1 and 2, blast cell colonies; 
ENT1 and 2, differentiated endothelial cells; HMT1, 2 and 3, differentiated hematopoietic 
cells. The cDNAs probes used for hybridization are indicated on the right. 



WO 00/11168 



11 



PCT/US99/19052 



FIGURE 14. AFT024 maintains in vivo repopulating stem cells. The ability of 3 
different stromal cell lines to support highly purified fetal liver stem cells was studied. 
Freshly purified day 14 fetal liver cells were transplanted directly (10 3 AA4.l + , lin' /)o , 
5 Sca-1 + , c-kit + cells plus 10 6 LyS.l competitor BM per mouse; n=6) into LyS.l congenic 
mice (Control). Additional stem cells from the same purification were also used to 
initiate Dexter-LTC over irradiated AFT024, 2012, and 2018 stromal cell monolayers 
(10 4 cells/ 10 cm dish). After 4 weeks of culture, 10% aliquots of each culture (10 3 stem 
cell equivalents) were transplanted into LyS.l recipients (n=8) together with 10 6 
10 competitor LyS.l BM cells. Peripheral blood cells from mice were analyzed for the 
presence of Ly5.2 + cells at 5, 12, 24, and 56 weeks after transplant. Error bars represent 
SEM. 

FIGURE 15. Long-term culture on AFT024 maintains greater levels of repopulating 
15 stem cell activity than short-term cytokine- or short-term AFT024-supported cultures. 
The levels of stem cell activity maintained in short-term cytokine-supported and short- 
term AFT024-supported cultures were compared to those maintained in long-term 
AFT024 coculture. Purified fetal liver cells were cultured for 5 days in suspension with 
cytokines or on an AFT024 monolayer (3000/well-12-well tray). Additional cells from 
20 the same purification were seeded onto AFT024 monolayers (3000/6 cm dish) and 
maintained in Dexter-LTC for 5 weeks. At completion of both culture periods the cells 
were harvested, mixed with LyS.l BM and used to transplant mice. Each mouse received 
20% of each culture (600 stem cell equivalents) and 4X10 5 competitor BM cells (4 
mice/culture). Peripheral blood cells from mice were analyzed for the presence of Ly5.2 + 
25 cells at 15 weeks after transplant. FL 1.0±0.57; FL/SL 0.75±0.25; FL/IL-6 1.8±0.14; 
SL/IL-6 3.2±0.46; FL/IL-6/SL 1.7±0.21; AFT024 5 days 2.8±0.11; AFT024 5 weeks 
32.2±7.4 * (p<0.004) Students t-test. See Methods for culture conditions and cytokine 
concentrations. FL, flk2/flt3-ligand; SL, steel factor. Error bars represent SEM. 

30 FIGURE 1$. Time course of cobblestone area formation on AFT024. The formation of 
stromal dependent CA derived from purified fetal liver stem cells was studied in AFT024 
cocultures. Characteristic clusters of at least 50 cells were scored as CA over 28 days of 
culture. Results are expressed as the mean number of CA/1000 input stem cells from 3 
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separate fetal liver purifications (300-600 cells/well in 12-well trays). Error bars 
represent the SEM. The frequency of CA after 28 days is approximately 1 for every 20 
input stem cells. 

5 FIGURE 17, High-proliferative potential multilineage clonogenic progenitors are 
selectively expanded on AFT024. The clonogenic progenitor content of stem cells 
maintained in AFT024 supported Dexter-LTC was determined. Enriched fetal liver stem 
cells were seeded onto AFT024 monolayers, at various time points, an individual well 
was harvested and the cells placed into semi-solid clonogenic progenitor assay (CFU-C). 

10 Colonies were scored at 8-14 days. Colonies were designated as HPP upon reaching a 
size >1 mm after 8 days. CFU numbers at days 0, 4, and 28 are averaged from 3-5 
individual stem cell purifications. Error bars represent the SEM for these experiments. 
Other time points are individual determinations. CFU are normalized to 1 000 input stem 
cells in the stromal cocultures for comparison to day 0 progenitors. *CFU-Mix (p= 01) 

15 and *CFU-HPP-Mix (p=. 001) are significantly expanded at day 28 compared to day 0, 
(Student's T-test). 

FIGURES 18A-18P. Cobblestone area-initiating cells are expanded on AFT024. 

Figure 18A. A quantitative estimate of the number of 28 day cultured stem cell 
20 equivalents required to form a CA after replating on secondary AFT024 monolayers was 

determined. Four different AFT024 cocultures from separate fetal liver purifications 

were studied (A, B, C, and D). In limiting-dilution assay, the frequency of stem cell 

equivalents required to form a CA in another 7 days was 1 in 4 (3.56±0.64, r 2 =0.96). 

Figure 18 B. CA maintenance in one of the 4 above cultures was followed for an 
25 additional 4 weeks. The frequency of stem cell equivalents maintaining CA was 

determined. At 37% negative wells the frequencies were: 2 weeks 1 in 3, 3 weeks 1 in 

10, 4 weeks 1 in 19, and at 5 weeks 1 in 29. 

FIGURES 19A-19B. Dlk expression analysis in stromal cell lines. Figure 19A. (Top) A 
30 1.6 kb dlk transcript is visualized in the parental AFT024 and 2012 cell lines and their 
subclones, but not in 2018, CFC034 and BFC012. (Bottom) The same filter hybridized with 
a b-actin probe. Figure 19B. RT-PCR analysis of 14 fetal liver-derived stromal cell lines 
and other cell lines. 
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2Q- Cobblestone area formation by hematopoietic stem cells in the presence of 
soluble dlk protein. Data are from 4 experiments; 2 each with adult BM cells (Sca-1 + , c- 
kit, lin i0/ ") and day 14 fetal liver cells (AA4.1 + , Sca-1 + , c-kit + , lin ,0/ "). Results are expressed 
5 as the ratio/fold-increase in CSA number for fourteen data points each (bars represent, none 
vs control; dlk vs none; dlk vs control) for the 4 different experiments. Error bars represent 
SEM. *P= 0.01 comparing dlk vs none to none vs control, **P= 0.001 comparing dlk vs 
control to none vs control (Student's t-test). 

10 FIGURE 21 Membrane bound dlk expression in transfected BFC012 cells. Full length dlk 
cDNA was transfected into BFC012 cells. {Left) A flow diagram of dlk expression in 
transfected BFC012 populations (BFC-dlk) and cells transfected with the selection plasmid 
alone (BFC-Zeo). (Right) Expression of dlk in a cloned line (BFC-dlk-5) derived from the 
expressing population and a control clone (BFC-Zeo- 1). 

15 

FJCUHES 2?A. CSA formation by hematopoietic stem cells in the presence of membrane- 
bound dlk. Figure 22A. Bars labeled BFC are from 5 groups (nontransfected BFC012 cells, 
two control pZeo transfected BFC012 populations, and two clones derived from the pZeo 
transfected populations). Bars labeled BFC-dlk are from three groups shown to express 

20 transfected dlk; one dlk-transfected BFC012 population (BFC-dlk) and two individual 
transfected clones (BFC-dlk- 1 and BFC-dlk-5). Error bars represent SEM. ** P< 0.001 
days 3, 4, and 5; *P< 0.01 days 6 and 7 (Student's t-test). Figure 22B. A clone derived 
from the dlk transfected populations of BFC012 cells (BFC-dlk-5) and a clone derived from 
pZeo transfected populations (BFC-Zeo-1) were used for CSA assay with purified fetal liver 

25 stem cells. CS A/1 000 input stem cells are expressed as the mean of three individual 
experiments, error bars represent the SEM. **P< 0.001 at days 4, 6, and 8 (Student's t-test). 

FIGURES 23A-23B. HPP multilineage clonogenic progenitors and in vivo repopulating 
stem cells are maintained in short-term dlk-expressing cocultures. Figure 23A. Fetal liver 
30 stem cells were purified as described and assayed for their progenitor content immediately 
after purification and after culture on BFC-dlk-5, and BFC-Zeo- 1. At day 4 the cultures 
were used for clonogenic progenitor (3 experiments) and transplantation assay (2 
experiments). Bars represent data from 3 experiments with day 0 cells (Fresh) and day 4 
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cocultured cells (BFC-dIk-5 and BFC-Zeo-1), error bars represent SEM. *P= 0.01 for total 
CFU-C from fresh stem cells compared to total CFU-C from BFC-dlk-5 cocultures at day 4; 
** P= 0.001 for total CFU-C from BFC-dlk-5 compared to total CFU-C from BFC-Zeo-1 
(Student's t-test). Figure 23 B. Analysis of in vivo repopulating ability of purified fetal 
5 liver stem cells cocultured for 4 days on BFC-dlk-5, BFC-Zeo-1, and AFT024 monolayers. 
Results are from nine individual mice in two experiments (4-5 mice in each experiment) at 
10 weeks after transplantation. P=0.05 for BFC-dlk-5 vs BFC-Zeo-1 (Student's t-test). 

FIGURE 24* Genes and predicted proteins isolated from primitive stem cells by the 
10 techniques of the present invention. (Sequence Identification Numbers are indicated on the 
figure). 

DVTAMEP PE S CM P TI Q N 

15 The present invention provides an isolated nucleic acid derived from an isolated 

hematopoietic stem cell, the isolated nucleic acid comprising the following 
characteristics: (1) specifically expressed in the hematopoietic stem cell; and (2) 
encoding a hematopoietic stem cell - specific protein. 

An embodiment of this invention further comprises the following characteristic: 

20 capable of hybridizing under standard conditions with a sequence selected from the group 
consisting of SEQ.ID.No.: 1, SEQ.ID.No.: 2, SEQ.ID.No.: 3, SEQ.ID.No.: 4, 
SEQ.ID.No.: 5, SEQ.ID.No.: 6, SEQ.ID.No.: 7, SEQ.ID.No.: 8, SEQ.ID.No.: 9, 
SEQ.ID.No.: 10, SEQ.ID.No.: 11, SEQ.ID.No.: 12, SEQ.ID.No.: 13, SEQ.ID.No.: 14, 
SEQ.ID.No.: 15, SEQ.ID.No.: 16, SEQ.ID.No.: 17, SEQ.ID.No.: 18, SEQ.ID.No.: 19, 

25 SEQ.ID.No.: 20, SEQ.ID.No.: 21, SEQ.ID.No.: 22, SEQ.ID.No.: 23, SEQ.ID.No.: 24, 
SEQ.ID.No.: 25, SEQ.ID.No.: 26, SEQ.ID.No.: 27, SEQ.ID.No.: 28, SEQ.ID.No.: 29, 
SEQ.ID.No.: 30, SEQ.ID.No.: 31, SEQ.ID.No.: 32, SEQ.ID.No.: 33, SEQ.ID.No.: 34, 
SEQ.ID.No.: 35, SEQ.ID.No.: 36, SEQ.ID.No.: 37, SEQ.ID.No.: 38, SEQ.ID.No.: 39, 
SEQ.ID.No.: 40, SEQ.ID.No.: 41, SEQ.ID.No.: 42, and SEQ.ID.No.: 43, SEQ.ID.No.: 

30 45, SEQ.ID.No.: 47, SEQ.ID.No.: 49, SEQ.ID.No.: 51, SEQ.ID.No.: 53, SEQ.ID.No.: 55, 
SEQ.ID.No.: 57, SEQ.ID.No.: 59, SEQ.ID.No.: 61, SEQ.ID.No.: 63, SEQ.ID.No.: 65, 
SEQ.ID.No.: 67, SEQ.ID.No.: 72 or a portion thereof. A portion thereof, in a preferred 
embodiment of this invention is the 5' end region or the 3* end region of the nucleic acid. 
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In another preferred embodiment, a portion thereof is at least a 8-18 nucleotide portion of 
the coding region. In yet another preferred embodiment, a portion therof is at least a 8-18 
nucleotide portion of a non-coding regulatory region or a binding region such as a stem- 
cell specific promoter or enhancer region. According to still another embodiment of the 
5 provided isolated nucleic acid further comprises the characteristic of encoding a protein 
capable of modulating hematopoietic stem cell activity. According to this invention 
modulating hematopoietic stem cell activity includes up-regulating, down-regulating or 
otherwise changing the activity of the hematopoietic stem cell. Such activity is 
contemplated as inducing differentiation or inhibiting differentiation of the cell. 

10 However, directing differentiation toward one or another daughter cell type is also within 
the scope of a preferred embodiment of this invention. Other preferred embodiments 
include but are not limited to modulation of transcription, translation, gene splicing, 
transport, proteolytic processing, replication, expression of cell surface markers and 
transplantation. According to still another embodiment of the present invention, the 

15 activity is selected from the group consisting of hematopoietic stem cell differentiation 
and hematopoietic stem cell replication. According to yet another embodiment of this 
invention, the protein is selected from the group consisting of a growth factor, a 
transcription factor, a splicing factor, a capping factor, a transport protein, a translation 
factor, and a replication factor. In one preferred embodiment of this invention, the 

20 provided nucleic acid comprises the nucleotide sequence of SEQ.ID.No.: 72, an analog 
thereof, or a portion thereof. According to another preferred embodiment of this 
invention, the hematopoietic stem cell is a primitive hematopoietic stem cell. In one 
embodiment of this invention, the primitive hematopoietic stem cell is selected from the 
group consisting of an umbilical cord cell , a bone marrow cell and a fetal liver cell. In a 

25 preferred embodiment of this invention, the primitive hematopoietic stem cell is selected 
from the group consisting of a AFT024 cell, a 2012 cell and a 2018 cell. 

The present invention further provides a composition comprising the provided 
nucleic acid, wherein the nucleic acid comprises one selected from the group consisting 
of SEQ.ID.No.: 1, SEQ.ID.No.: 2, SEQ.ID.No.: 3, SEQ.ID.No.: 4, SEQ.ID.No.: 5, 

30 SEQ.ID.No.: 6, SEQ.ID.No.: 7, SEQ.ID.No.: 8, SEQ.ID.No.: 9, SEQ.ID.No.: 10, 
SEQ.ID.No.: 11, SEQ.ID.No.: 12, SEQ.ID.No.: 13, SEQ.ID.No.: 14, SEQ.ID.No.: 15, 
SEQ.ID.No.: 16, SEQ.ID.No.: 17, SEQ.ID.No.: 18, SEQ.ID.No.: 19, SEQ.ID.No.: 20, 
SEQ.ID.No.: 21, SEQ.ID.No.: 22, SEQ.ID.No.: 23, SEQ.ID.No.: 24, SEQ.ID.No.: 25, 



WO 00/11168 



16 



PCT/US99/19052 



SEQ.ID.No.: 26, SEQ.ID.No.: 27, SEQ.ID.No.: 28, SEQ.ID.No.: 29, SEQ.ID.No.: 30, 
SEQ.ID.No.: 31, SEQ.ID.No.: 32, SEQ.ID.No.: 33, SEQ.ID.No.: 34, SEQ.ID.No.: 35, 
SEQ.ID.No.: 36, SEQ.ID.No.: 37, SEQ.ID.No.: 38, SEQ.ID.No.: 39, SEQ.ID.No.: 40, 
SEQ.ID.No.: 41, SEQ.ID.No.: 42, SEQ.ID.No.: 43, SEQ.ID.No.: 45, SEQ.ID.No.: 47, 
5 SEQ.ID.No.: 49, SEQ.ID.No.: 51, SEQ.ID.No.: 53, SEQ.ID.No.: 55, SEQ.ID.No.: 57, 
SEQ.ID.No.: 59, SEQ.ID.No.: 61, SEQ.ID.No.: 63, SEQ.ID.No.: 65, SEQ.ID.No.: 67, 
SEQ.ID.No.: 72 or a portion thereof. According to one embodiment of this invention, the 
the nucleic acid is selected from the group consisting of DNA, RNA and cDNA. Another 
embodiment of this invention is a vector comprising the provided nucleic acid. 

10 According to yet another embodiment, the vector comprises viral or plasmid DNA. A 
further embodiment of this invention is an expression vector comprising the provided 
nucleic acid and a regulatory element. A still further embodiment of this invention is a 
host vector system which comprises the expression vector in a suitable host. In a 
preferred embodiment of this invention, the suitable host is selected from the group 

15 consisting of a bacterial cell, a eukaryotic cell, a mammalian cell and an insect cell. 

The present invention additionally provides an isolated hematopoietic stem cell 
specific protein or a portion thereof encoded by the provided nucleic acid. According to 
one embodiment of this invention is the provided protein further comprising the 
characteristic of being capable of modulating hematopoietic stem cell activity. According 

20 to this invention modulating hematopoietic stem cell activity includes up-regulating, 
down-regulating or otherwise changing the activity of the hematopoietic stem cell. Such 
activity is contemplated as inducing differentiation or inhibiting differentiation of the cell. 
However, directing differentiation toward one or another daughter cell type is also within 
the scope of a preferred embodiment of this invention. Other preferred embodiments 

25 include but are not limited to modulation of transcription, translation, gene splicing, 
transport, proteolytic processing, replication, expression of cell surface markers and 
transplantation. According to still another embodiment of the present invention, the 
activity is selected from the group consisting of hematopoietic stem cell differentiation 
and hematopoietic stem cell replication. According to yet another embodiment of this 

30 invention, the protein is selected from the group consisting of a growth factor, a 
transcription factor, a splicing factor, a capping factor, a transport protein, a translation 
factor, and a replication factor. According to one embodiment the activity is selected 
from the group consisting of hematopoietic stem cell differentiation and hematopoietic 
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stem cell replication. According to another embodiment, the protein is selected from the 
group consisting of a growth factor, a transcription factor, a splicing factor, a capping 
factor, a transport protein, a translation factor, and a replication factor. According to still 
another embodiment, the protein has substantially the same amino acid sequence as one 

5 selected from the group consisting of SEQ.ID.No.: 42, SEQ.ID.No.: 44, SEQ.ID.No.: 46, 
SEQ.ID.No.: 48, SEQ.ID.No.: 50, SEQ.ID.No.: 52, SEQ.ID.No.: 54, SEQ.ID.No.: 56, 
SEQ.ID.No.: 58, SEQ.ID.No.: 60, SEQ.ID.No.: 62, SEQ.ID.No.: 64, SEQ.ID.No.: 66, 
SEQ.ID.No.: 68, SEQ.ID.No.: 70, SEQ.ID.No.: 71, and SEQ.ID.No.: 73. 

The present invention further still provides a nucleic acid probe capable of 

10 specifically hybridizing with the provided nucleic acid under standard hybridization 
conditions. 

Also, the present invention provides an antibody capable of specifically binding to 
the provided protein without substantially cross-reacting with a non-stem cell specific 
protein or homologs thereof under conditions permissive to antibody binding. 
15 Additionally, the present invention provides a cell capable of producing the provided 
antibody. 

In addition, the present invention provides a method for identifying the presence 
of a primitive hemopoietic stem cell in a sample comprising nucleic acids specifically 
expressed in hematopoietic stem cells comprising (a) contacting the sample with the 

20 provided antibody under conditions permissive to the formation of an antibody complex; 
(b) detecting the presence of the complex formed in step (a), the presence of a complex 
formed indicating the presence of a primitive hemopoietic stem cell in the sample. 
According to one embodiment of this invention, the antibody is labeled with a detectable 
marker. In a preferred embodiment, the detectable marker is selected from the group 

25 consisting of a radioactive isotope, enzyme, magnetic bead, dye, flourescent marker and 
biotin. 

Further still, the present invention provides a method for generating a stem 
cell/progenitor cell from a primitive hematopoietic cell in a sample comprising contacting 
the sample with the provided protein . Another embodiment of this invention, provides a 
30 method for generating a stem cell/progenitor cell from a primitive hematopoietic cell in a 
sample comprising contacting the sample with the provided nucleic acid. According to an 
preferred embodiment, the nucleic acid is in an expression vector. According to another 
preferred embodiment the nucleic acid is introduced into the cell under conditions 
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permissive to the expression of the nucleic acid. 

The present invention further provides a method for identifying the presence in a 
sample of a compound that modulates hematopoietic stem cell activity comprising: (a) 
contacting the hematopoietic stem cell with the sample; (b) determining the 
5 hematopoietic stem cell activity; and (c) comparing the hematopoietic stem cell activity 
determined in step (b) with the activity determined in the absence of the compound an 
increase or decrease in hematopoietic stem cell activity indicating the presence in the 
sample of a compound that modulates hematopoietic stem cell activity. According to one 
embodiment, the activity is selected from the group consisting of gene expression, 

10 replication, differentiation, transplantation, and self regeneration. The present invention 
also still further provides a compound identified by the method of this invention, 
previously unknown. 

The present invention even further still provides a method for identifying 
primitive hematopoietic stem cell-specific nucleic acids , comprising: (a) creating a 

15 primitive hematopoietic stem cell cDNA library and a non-primitive stem cell immune 
cell cDNA library; and (b) subtracting the two libraries, thereby identifying primitive 
stem cell specific nucleic acids. According to one embodiment is (i) contacting the 
nucleic acids of the stem cell and non-stem cell libraries with each other under conditions 
permissive to hybridization, thereby forming hybrid complexes; (ii) separating the hybrid 

20 complexes formed in step (b) from the nucleic acids which did not form complexes; and 
(iii) isolating the nucleic acids which did not form complexes, thereby identifying 
hematopoietic stem cell specific nucleic acids. In still another embodiment, step (ii) 
further comprising amplification of the nucleic acids. Yet another embodiment is step 
(iii) further comprising ampliciation of the nucleic acids which did not form complexes. 

25 Even still another embodiment is further comprising displaying the amplified DNA on a 
chromatography gel. A further embodiment is step (b) comprising differential display of 
the two libraries, thereby identifying primitive stem cell specific nucleic acids. Also yet 
another embodiment is step (b) comprising representation difference analysis of the two 
libraries, thereby identifying primitive stem cell specific nucleic acids. Yet even another 

30 embodiment is further comprising cloning the stem cell specific nucleic acid. According 
to a preferred embodiment, the stem cell is selected from the group consisting of AF024, 
2012, and 2018. The present invention further provides a nucleic acid identified by the 
provided method. 
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The present invention additionally provides a composition comprising the 
provided compound and a carrier. An embodiment of this invention is a pharmaceutical 
composition comprising the provided compound and a pharmaceutically acceptable 
carrier. 

5 Also the present invention additionally, provides a primitive hematopoietic stem 

cell specifically expressing one selected from the group consisting of: SEQ.ID:No.: 1, 
SEQ.ID.No.: 2, SEQ.ID.No.: 3, SEQ.ID.No.: 4, SEQ.ID.No.: 5, SEQ.ID.No.: 6, 
SEQ.ID.No.: 7, SEQ.ID.No.: 8, SEQ.ID.No.: 9, SEQ.ID.No.: 10, SEQ.ID.No.: 11, 
SEQ.ID.No.: 12, SEQ.ID.No.: 13, SEQ.ID.No.: 14, SEQ.ID.No.: 15, SEQ.ID.No.: 16, 

10 SEQ.ID.No.: 17, SEQ.ID.No.: 18, SEQ.ID.No.: 19, SEQ.ID.No.: 20, SEQ.ID.No.: 21, 
SEQ.ID.No.: 22, SEQ.ID.No.: 23, SEQ.ID.No.: 24, SEQ.ID.No.: 25, SEQ.ID.No.: 26, 
SEQ.ID.No.: 27, SEQ.ID.No.: 28, SEQ.ID.No.: 29, SEQ.ID.No.: 30, SEQ.ID.No.: 31, 
SEQ.ID.No.: 32, SEQ.ID.No.: 33, SEQ.ID.No.: 34, SEQ.ID.No.: 35, SEQ.ID.No.: 36, 
SEQ.ID.No.: 37, SEQ.ID.No.: 38, SEQ.ID.No.: 39, SEQ.ID.No.: 40, SEQ.ID.No.: 41, 

15 SEQ.ID.No.: 42, and SEQ.ID.No.: 43, SEQ.ID.No.: 45, SEQ.ID.No.: 47, SEQ.ID.No.: 
49, SEQ.ID.No.: 51, SEQ.ID.No.: 53, SEQ.ID.No.: 55, SEQ.ID.No.: 57, SEQ.ID.No.: 59, 
SEQ.ID.No.: 61, SEQ.ID.No.: 63, SEQ.ID.No.: 65, SEQ.ID.No.: 67, SEQ.ID.No.: 72 or a 
portion thereof. An embodiment of this invention is a primitive hematopoietic stem cell 
specifically expressing a nucleic acid identified by the provided method. 

20 Yet additionally, the present invention provides a method for treating a condition 

in a subject comprising administering to the subject a therapeutically effective amount of 
the provided pharmaceutical composition. According to an embodiment of this invention, 
the condition is an immune system condition. In a further embodiment of this invention, 
the condition is leukemia. 

25 The present invention provides a method of introducing an exogenous nucleic acid 

into a hematopoietic stem cell comprising contacting the stem cell with the provided 
composition. 

Finally, the present invention provides a method of ex vivo expansion of 
hematopoietic stem cells comprising contacting the cell with the provided composition. 
30 According to an embodiment of this invention, the ex vivo expanded hematopoietic stem 
cells are available for therapeutic use. The expanded cells are available to receive 
exogenous genes, including by retroviral or other vectors which require a round of 
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replication. Alternatively, the cells are available for transplantation either autologously or 
heterologously. 

As used herein, the term, a sequence is conserved if there is substantial homology 
of sequence between multiple gene species. 
5 As used herein, the terms, "hybridization" and "binding" in the context of probes, 

primers and denatured DNA are used interchangeably. Probes which are hybridized or 
bound to denatured DNA are aggregated to complementary sequences in the 
polynucleotide. Whether or not a particular probe remains aggregated with the 
polynucleotide depends on the degree of complementarity, the length of the probe, and 

10 the stringency of the binding conditions. The higher the stringency, the higher must the 
degree of complementarity, and/or the longer the probe. 

As used herein, the term, probe, refers to an oligonucleotide designed to be 
sufficiently complementary to a sequence in a denatured nucleic acid to be probed, in 
relation to its length, to be bound under selected stringency conditions. Primers may vary 

15 in length. Preferably such primers should be sufficiently long to hybridize to the 
modified RNAs in a specific and stable manner. A semi-random primer as the term is 
used herein, encompasses a class of primers wherein either a discrete portion of the 
primer is random, while another discrete portion is conserved as well as primers which 
have nucleotide preferences at particular positions within a sequence. For example, the 

20 discrete portion-type primer may have a predetermined adaptor sequence at its 5 1 end and 
a random sequence at its 3' end. Alternatively, several preferred primers have nucleotide 
preferences at specific positions within the primers while other positions are random. A 
degenerate primer as the term is used herein, encompasses a cocktail or mixture of 
primers wherein one or more of the possible triplet nucleotide sequences encoding an 

25 amino acid is incorporated into the primer sequence. For example, Serine may be 
encoded by six separate triple sequences (AGU, AGC, UCU, UCC, UCA, and UCG). 
Thus, a "degenerate" primer may reflect the degeneracy of the nucleotide triplet code. 
Alternatively, a randomized primer, as the term is used herein, encompasses a primer 
wherein, the nucleotide at one or more positions may be randomized in order to yield a 

30 triplet sequence encoding an alternative or a random amino acid at the position. 

An end region as the term is used herein, consists of the end nucleotide and a 
portion of the region including as much as that half of the entire sequence. For example, 
the "3' end region" or "3' region" of a primer may include the 3* half of the primer. 
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A preferred method of hybridization is blot hybridization. See Sambrook et al. 
1989 Molecular Cloning: A Laboratory Manual 2nd Ed. for additional details regarding 
blot hybridization. Using this method, separated amplification products are transferred 
onto a solid matrix, such as a filter. The probe, which is detectable, either directly or 
5 indirectly, is hybridized to the solid matrix under appropriate conditions. The matrix is 
washed to remove excess probe. Thereafter the probe which specifically hybridized to 
the solid matrix can be detected. 

The probe can be DNA or RNA and can be made detectable by any of the many 
labeling techniques readily available and known to the skilled artisan. Such methods 

10 include, but are not limited to, radio-labeling, digoxygenin-labeling, and biotin-labeling. 
A well-known method of labeling DNA is 32 P using DNA polymerase, Klenow enzyme 
or polynucleotide kinase. In addition, there are known non-radioactive techniques for 
signal amplification including methods for attaching chemical moieties to pyrimidine and 
purine rings (Dale, R.N.K. et al, 1973 Proc. Natl Acad. Sci. USA 70:2238-42), methods 

15 which allow detection by chemiluminescence (Barton, S.K. et al, 1992 J. Am, Client. Soc. 
1 14:8736-40) and methods utilizing biotinylated nucleic acid probes (Johnson, T.K. et al, 
1983 Anal Biochem. 133:125-131; Erickson, P.F. et al, 1982 1 Immunol Methods 
51:241-49; Matthaei, F.S. et al, 1986 Anal Biochem. 157-123-28) and methods which 
allow detection by fluorescence using commercially available products. Non-radioactive 

20 labeling kits are also commercially available. 

A basic description of nucleic acid amplification is described in Mullis, U.S. 
Patent No. 4,683,202, which is incorporated herein by reference. The amplification 
reaction uses a template nucleic acid contained in a sample, two primer sequences and 
inducing agents. The extension product of one primer when hybridized to the second 

25 primer becomes a template for the production of a complementary extension product and 
vice versa, and the process is repeated as often as is necessary to produce a detectable 
amount of the sequence. 

The inducing agent may be any compound or system which will function to 
accomplish the synthesis of primer extension products, including enzymes. Suitable 

30 enzymes for this purpose include, for example, E.coli DNA polymerase I, thermostable 
Taq DNA polymerase, Klenow fragment of E.coli DNA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, reverse transcriptase and other enzymes 
which will facilitate combination of the nucleotides in the proper manner to form 
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amplification products. The oligonucleotide primers can be synthesized by automated 
instruments sold by a variety of manufacturers or can be commercially prepared based 
upon the nucleic acid sequence of this invention. 

The degree of hybridization depends on the degree of complementarity, the length 
5 of the nucleic acid molecules being hybridized, and the stringency of the conditions in a 
reaction mixture. Stringency conditions are affected by a variety of factors including, but 
not limited to temperature, salt concentration, concentration of the nucleic acids, length of 
the nucleic acids, sequence of the nucleic acids and viscosity of the reaction mixture. 
More stringent conditions require greater complementarity between the nucleic acids in 

10 order to achieve effective hybridization. 

"Hybridization" and "binding" in the context of probes, primers and denatured 
DNA are used interchangeably. Probes which are hybridized or bound to denatured DNA 
are aggregated to complementary sequences in the polynucleotide. Whether or not a 
particular probe remains aggregated with the polynucleotide depends on the degree of 

15 complementarity, the length of the probe, and the stringency of the binding conditions. 
The higher the stringency, the higher must the degree of complementarity, and/or the 
longer the probe. 

"Probe" refers to an oligonucleotide designed to be sufficiently complementary to 
a sequence in a denatured nucleic acid to be probed, in relation to its length, to be bound 

20 under selected stringency conditions. 

Primers may vary in length. Preferably such primers should be sufficiently long 
to hybridize to the modified RNAs in a specific and stable manner. 

A semi-random primer as the term is used herein, encompasses a class of primers 
wherein either a discrete portion of the primer is random, while another discrete portion is 

25 conserved as well as primers which have nucleotide preferences at particular positions 
within a sequence. For example, the discrete portion-type primer may have a 
predetermined adaptor sequence at its 5* end and a random sequence at its 3* end. 
Alternatively, several preferred primers have nucleotide preferences at specific positions 
within the primers while other positions are random. 

30 A degenerate primer as the term is used herein, encompasses a cocktail or mixture 

of primers wherein one or more of the possible triplet nucleotide sequences encoding an 
amino acid is incorporated into the primer sequence. For example, Serine may be 
encoded by six separate triple sequences (AGU, AGC, UCU, UCC, UCA, and UCG). 
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Thus, a "degenerate" primer may reflect the degeneracy of the nucleotide triplet code. 
Alternatively, a randomized primer, as the term is used herein, encompasses a primer 
wherein, the nucleotide at one or more positions may be randomized in order to yield a 
triplet sequence encoding an alternative or a random amino acid at the position. 

Solid matrices are available to the skilled artisan. Solid phases useful to serve as a 
matrix for the present invention include but are not limited to polystyrene, polyethylene, 
polypropylene, polycarbonate, or any solid plastic material in the shape of test tubes, 
beads, microparticles, dip-sticks, plates or the like. Additionally matrices include, but are 
not limited to membranes, 96-well microtiter plates, test tubes and Eppendorf tubes. 
Solid phases also include glass beads, glass test tubes and any other appropriate shape 
made of glass. A functionalized solid phase such as plastic or glass which has been 
modified so that the surface carries carboxyl, amino, hydrazide, or aldehyde groups can 
also be used. In general such matrices comprise any surface wherein a ligand-binding 
agent can be attached or a surface which itself provides a ligand attachment site. 

As used herein, "pharmaceutical^ acceptable" refers to molecular entities and 
compositions that are physiologically tolerable and do not typically produce an allergic 
or similar untoward reaction, such as gastric upset, dizziness and the like, when 
administered to a human. A pharmaceutical^ acceptable carrier encompasses any of 
the standard pharmaceutical^ accepted carriers, such as phosphate buffered saline 
solution, water emulsions such as an oil/water emulsion or a triglyceride emulsion, 
various types of wetting agents, tablets, coated tablets and capsules. Typically such 
carriers contain excipients such as starch, milk, sugar, certain types of clay, gelatin, 
stearic acid, talc, vegetable fats or oils, gums, glycols, or other known excipients. Such 
carriers may also include flavor and color additives or other ingredients. The invention 
also provides for pharmaceutical compositions together with suitable diluents, 
preservatives, solubilizers, emulsifiers and adjuvants. Other embodiments of the 
compositions of the invention incorporate particulate forms, protective coatings, 
protease inhibitors or permeation enhancers for various routes of administration, 
including but not limited to intravenous, intramuscular, parenteral, pulmonary, nasal 
and oral. 

As used herein, an "effective amount" is the amount required to achieve a 
clinically significant effect. For example a significant reduction of infection, or 
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reduction of cell growth or reduction of tumor size is a reduction of preferably of at 
least 30 percent, more preferably of at least 50 percent, most preferably of at least 90 
percent. Accordingly, the effective amount will vary with the subject being treated, as 
well as the condition to be treated. For the purposes of this invention, the methods of 
administration are to include, but are not limited to administration cutaneously, 
subcutaneously, intravenously, parenterally, orally, topically, or by aerosol. 

The present invention further contemplates therapeutic compositions useful in 
practicing the therapeutic methods of this invention. A subject therapeutic composition 
includes, in admixture, a pharmaceutical^ acceptable excipient (carrier) and one or 
more of a polypeptide analog or fragment of the provided peptide or peptide 
composition, a peptidomimetic composition thereof as described herein as an active 
ingredient. A cocktail of the provided pharmaceutical composition in various 
combinations is also contemplated. 

The preparation of therapeutic compositions which contain polypeptides, analogs 
or active fragments as active ingredients is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or suspensions, 
however, solid forms suitable for solution in, or suspension in, liquid prior to injection 
can also be prepared. The preparation can also be emulsified. The active therapeutic 
ingredient is often mixed with excipients which are pharmaceutical^ acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, 
saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if 
desired, the composition can contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents which enhance the effectiveness of 
the active ingredient. 

A polypeptide, analog or active fragment can be formulated into the therapeutic 
composition as neutralized pharmaceutical^ acceptable salt forms. Pharmaceutical^ 
acceptable salts include the acid addition salts (formed with the free amino groups of the 
polypeptide or antibody molecule) and which are formed with inorganic acids such as, 
for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed from the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
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calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, 2-ethyIamino ethanol, histidine, procaine, and the like. 

The therapeutic polypeptide-, analog- or active fragment-containing 
compositions are conventionally administered intravenously, as by injection of a unit 
dose, for example. The term "unit dose" when used in reference to a therapeutic 
composition of the present invention refers to physically discrete units suitable as 
unitary dosage for humans, each unit containing a predetermined quantity of active 
material calculated to produce the desired therapeutic effect in association with the 
required diluent; i.e., carrier, or vehicle. 

The compositions are administered in a manner compatible with the dosage 
formulation, and in a therapeutically effective amount. The quantity to be administered 
depends on the subject to be treated, capacity of the subject's immune system to utilize 
the active ingredient, and degree of inhibition desired. Precise amounts of active 
ingredient required to be administered depend on the judgment of the practitioner and 
are peculiar to each individual. However, suitable dosages may range from about 0.1 
to 20, preferably about 0.5 to about 10, and more preferably one to several, milligrams 
of active ingredient per kilogram body weight of individual per day and depend on the 
route of administration. Suitable regimes for initial administration and booster shots are 
also variable, but are typified by an initial administration followed by repeated doses at 
one or more hour intervals by a subsequent injection or other administration. 
Alternatively, continuous intravenous infusion sufficient to maintain concentrations of 
ten nanomolar to ten micromolar in the blood are contemplated. 

As used herein, the term "synthetic amino acid" means an amino acid which is 
chemically synthesized and is not one of the 20 amino acids naturally occurring in nature. 
As used herein, the term "biosynthetic amino acid" means an amino acid found in nature 
other than the 20 amino acids commonly described and understood in the art as "natural 
amino acids." 

As used herein, amino acid residues are preferred to be in the "L" isomeric 
form. However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the polypeptide. 
NH2 refers to the free amino group present at the amino terminus of a polypeptide. 
COOH refers to the free carboxy group present at the carboxy terminus of a 
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polypeptide. Abbreviations for amino acid residues are used in keeping with standard 
polypeptide nomenclature delineated in J. Biol Chem., 243:3552-59 (1969). 

It should be noted that all amino-acid residue sequences are represented herein 
by formulae whose left and right orientation is in the conventional direction of amino- 
5 terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a further 
sequence of one or more amino-acid residues. 

Amino acids with nonpolar R groups include: Alanine, Valine, Leucine, 
Isoleucine, Proline, Phenylalanine, Tryptophan and Methionine. Amino acids with 

10 uncharged polar R groups include: Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. Amino acids with charged polar R groups (negatively 
charged at Ph 6.0) include: Aspartic acid and Glutamic acid. Basic amino acids 
(positively charged at pH 6.0) include: Lysine, Arginine and Histidine (at pH 6.0). 
Amino acids with phenyl groups include: Phenylalanine, Tryptophan and Tyrosine. 

15 Particularly preferred substitutions are: Lys for Arg and vice versa such that a positive 
charge may be maintained; Glu for Asp and vice versa such that a negative charge may 
be maintained; Ser for Thr such that a free -OH can be maintained; and Gin for Asn 
such that a free NFL can be maintained. Amino acids can be in the "D" or "L" 
configuration. Use of peptidomimetics may involve the incorporation of a non-amino 

20 acid residue with non-amide linkages at a given position. 

Amino acid substitutions may also be introduced to substitute an amino acid with 
a particularly preferable property. For example, a Cys may be introduced a potential 
site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid 

25 in biochemical catalysis). Pro may be introduced because of its particularly planar 
structure, which induces -turns in the protein's structure. 

As used herein, "pM" means picomolar, "nM" means nanmolar, M uM, means 
micromolar, "mM" means millimolar, "ul" or "jil" mean microliter, "ml" means milliliter, 
"1" means liter. 

30 

The following examples are presented in order to more fully illustrate the 
preferred embodiments of the invention. They should in no way be construed, however, 
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as limiting the broad scope of the invention. While the invention is described and 
illustrated herein by references to various specific material, procedures and examples, it is 
understood that the invention is not restricted to the particular material combinations of 
material, and procedures selected for that purpose. Numerous variations of such details 
5 can be implied as will be appreciated by those skilled in the art. 

EXAMPLES 

EXAMPLE 1: Molecules specifically expressed in murine stem/progenitor 
10 cells 

Protein tyrosine kinases and phosphatases expressed in murine hematopoietic 
stem cells have been previously identified (Matthews, W., et al. 1991; Matthews, W., et 
al. 1991; Dosil, M., et al. 1996). These molecules play important roles in hematopoiesis 

15 and development (Dosil, M., et al. 1996; Mackarehtschian, K., et al. 1995; Shalaby, F., et 
al. 1995; Kabrun, N. , et al. 1997). The present invention contemplates an even more 
global approach in order to identify molecules specifically expressed in the murine 
stem/progenitor cell hierarchy. For most of these studies purified (AA4.1+Lin- 
/loSca+ckit+) fetal liver cells were used. This population is approximately 1,000-fold 

20 enriched for in vivo repopulating activity (LTRA) measured by competitive repopulation 
(Ly5.1/Ly5.2 congenic system) (Moore, K.A., et al. 1997) . All myeloid and lymphoid 
lineages in primary and in secondary recipients are repopulated by these cells. This 
degree of enrichment is comparable to the current "state of the art." Other primitive 
members of the stem/progenitor cell hierarchy share this same cell surface phenotype. 

25 These include: (1) LTCIC or cobblestone area forming cells (Ploemacher, R.E., et al. 
1989; Ploemacher, R.E., et al. 1991) , (2) CFU-blast progenitors , (3) HPP-CFC 
progenitors (Lowry, P. A., et al. 1995) and (4) stromal-dependent B-lymphoid progenitors 
(Whitlock, C.A., and Muller-Sieburg, C.E. 1990) . The AA4.1+Lin-/loSca-ckit+ subset is 
depleted of LTRA but contains significant in vitro progenitor activity. In contrast, no 

30 stem/progenitor cell activity is found in the AA4.1 subset (Jordan, C.T., et al. 1990) . 
Short-term (5-7 days) cytokine cultures of stem cells were used to generate committed 
progenitor populations at the expense of LTRA (Traycoff, CM., et al. 1996; Peters, S.O., 
et al. 1995; Knobel, K.M., et al. 1994; Yonemura, Y., et al. 1996). In summary, several 
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cell populations were defined which represent the beginning, the middle and the end 
points of the hematopoietic hierarchy. This sets the stage for a comparative analysis of 
gene expression patterns. A goal of the present invention is to complement the physical 
and functional phenotypes of stem/progenitor cells with profiles of uniquely expressed 
5 genes. It was hypothesized that some of these gene products contribute to the unique 
biological properties of primitive stem/progenitor cells and therefore are regulators of 
self-renewal, proliferation, commitment and other processes. 

There are a number of ways to compare gene expression profiles which are 
available to one of skill in the art. It is possible to do this by exhaustive sequencing of 

10 representative cDNA collections obtained from stem cell and mature cell sources 
followed by "electronic subtraction". This approach has several drawbacks. Most 
importantly, the number of sequences which must be obtained is prohibitive. For a 
homogeneously pure population this number is on the order of 50,000 (based on 
approximately 10-20,000 expressed genes in an average cell type and a statistical 

15 calculation). In practice, even the most purified stem cell population is heterogeneous. 
Stem cell enrichment values are only meaningful in relation to an unenriched standard 
and cannot be converted into an absolute stem cell number. It has been documented that 
cell populations with the same cell-surface phenotypes can differ in biological activity . 
The unique properties of stem cells also suggest caution when extrapolating from 

20 expressed gene numbers in other cells. In short, the extent of sequencing necessary to 
ensure complete coverage of gene expression in stem cells is not possible to estimate. 
Normalization procedures (Uchida, N., et al. 1995; Patanjali, S.R., et al. 1991; Soares, 
M.B., et al. 1994) designed to "equalize" the mRNA abundance classes are not advisable 
because they obliterate potentially important quantitative expression differences. 

25 Additionally, a high-throughput sequencing effort is not applicable to numerous libraries. 
Comparisons of gene expression in diverse sources of stem cells will provide valuable 
information. An elegant technique, Serial Analysis of Gene Expression (SAGE) permits 
the rapid acquisition of thousands of DNA sequence "tags" (Zhang, L., et al. 1997) . This 
technology was not considered herein because the size of each sequence "tag" is very 

30 small ( 10 base pair, bp). Therefore, SAGE is only informative in two extremes; exact 
nucleotide matches or no matches to sequences in the databases. This limits database 
comparison to the same species from which the "tags" originate. A key component of the 
strategy presented herein relies on broad bioinformatic database comparisons. In 
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addition, even with a specific "tag" one still needs to obtain a full-length cDNA clone for 
functional studies. Analysis of gene expression can be done in single progenitor cells 
after these form a colony "start" (Brady, G., et al. 1990; Brady, G., and Iscove, N.N. 
1993). Replating of the sibling cells in a "start" colony allows the approximation of the 
5 lineage potential present in the starting cell. This technique suffers from several 
drawbacks. First, it is limited to 3', non-coding ends, thus preventing protein database 
comparisons. Second, the technique relies on cell growth, thus it is not suitable for 
analysis of quiescent cells. Moreover, approaches to allow colony-formation by true stem 
cells with a retention of primitive properties are in their infancy (Ball, T.C., et al. 1995; 

10 Trevisan, M., et al. 1996). Third, the technique does not take into account stochastic 
models of stem cell behavior (Ogawa, M. 1993). To accurately reveal physical and 
functional properties of stem cells it is wise to analyze populations where stochastic 
differences would average out. Single-cell derived cDNA populations could however, 
provide valuable "adjunct" material for more refined gene expression screens. There are 

15 many techniques to physically identify differentially expressed genes. For these and 
other reasons, most notably the technical and economic ease with which physical pre- 
enrichments of cDNA libraries can be achieved, the studies described herein began with 
cDNA libraries which are highly enriched in differentially expressed sequences. The 
integration of individual techniques served to overcome the inherent limitations of each 

20 technique. Three strategies were successfully employed: (1) Differential Display (DD) 
(Liang, P., et al. 1994; Bauer, D., et al. 1993), (2) Representational Difference Analysis 
(RDA) (Braun, B.S., et al. 1995; Hubank, M., and Schatz, D.G. 1994; Diatchenko, L., et 
al. 1996) and (3) standard subtractive hybridization (Li, W.-B., et al. 1994; Harrison, 
S.M., et al. 1995) . The latter underlies the present invention. A key feature is that the 

25 differentially expressed cDNAs have a high probability of being full-length. This 
facilitates a rapid transition to functional studies. The two former techniques were 
utilized because of "visual" nature (DD) and the ability to generate representative, 
differentially expressed probe populations (RDA) in a rapid manner. 

Murine stem cell gene expression profiles As a first step, a series of high quality, 

30 representative cDNA libraries were generated. The cDNA populations were directionally 
cloned into the pSport-1 or pSport-2 plasmids (BRL-Gibco). The most important 
libraries originate from purified stem cells. In one case, enough AA4.1+Lin- 
/loSca+ckit+cells were purified to allow construction of a non-based library using 
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standard methodologies. This library contains -4x106 independent recombinants 
(average cDNA insert size of 1-2 kilobases, kb). A second library was constructed using 
a new PCR-based technology called cap-finder (Clontech) designed to yield full-length 
cDNA copies. cDNAs ranging from 1 to 4 kb were commonly attained using this 
technique. For cap-finder procedures the purified cells were processed into DNAse-I 
digested, poly-A+ mRNA according to microscale procedures routinely in use. The 
synthesis of cDNA was done with an aliquot of mRNA corresponding to approximately 
20,000 cell equivalents. It has not been necessary to use less material, therefore this is 
not a lower limit. A Not I restriction site was included in the 3 f reverse transcriptase 
primer (cap-finder, version 2) to facilitate directional cloning. An aliquot of the cDNA 
was amplified for varying PCR cycle numbers, and analyzed by Southern blots (pseudo- 
Northerns) for the presence of full-length copies of b-actin, GAPDH, CD 18, flk2/flt3, 
cdk4, CD34 and other mRNAs. Optimal cycle numbers were used to amplify the 
remaining cDNA. The cDNA was cloned into the pSport-1 plasmid. The AA4.1+Lin- 
/IoSca+ckit+ cap-finder library contains -3 xl06 independent recombinants. Single-pass 
sequence analysis of random clones from this library indicated that 95% were full-length 
(based on sequences with an exact match in Genbank). Much more extensive sequencing 
of numerous clones from subtracted libraries has confirmed this. Other libraries 
constructed in similar ways include two libraries from AA4.1 cells, and two libraries from 
AA4.1+Lin-/loSca-ckit+ cells. Finally, a library was constructed from AA4.1+Lin- 
/loSca+ckit+ cells cultured for 5-7 days in a differentiation promoting cytokine cocktail 
(IL3, IL6, KL). Competitive repopulation and in vitro progenitor assays confirmed a 
complete loss of LTRA with a significant retention of progenitor cells. All of the cDNA 
libraries are large (> 2x106 independent recombinants) and of high quality (1-2 kb 
average insert size). In summary, the most primitive, intermediate and most mature 
members of the hematopoietic hierarchy were "converted" into representative panels of 
expressed genes. 

Subtractive Hybridization. These cDNA libraries were used in subtractive 
hybridizations to enrich for differentially expressed genes. Target libraries from 
AA4.1+Lin-/loSca+ckit+ cells were subtracted with an AA4.1 driver cDNA library. This 
yields a population of cDNA clones which is enriched in sequences expressed in 
primitive stem/progenitor cells but not in mature cell types. In practice, a single-stranded 
target library is hybridized to an excess of in vitro synthesized biotinylated RNA from the 
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driver library as described . The opposite orientations of the cloning sites in pSport-1 and 
2 insure target and driver complementarity. The target library was enzymatically 
converted to a single-stranded form using gene II protein and exonuclease III (Gene- 
Trapper protocols, BRL-Gibco). Previously, this was done by infection with Ml 3 helper 
5 phage often resulting in a bias for small cDNA inserts. Here, driver/target combinations 
were subtracted two times in order to facilitate the removal of commonly expressed 
sequences (verified by elimination of "housekeeping" genes such as b-actin and 
GAPDH). Concomitant enrichment of known, differentially expressed genes is also 
verified. Generally flk2/flt3 and CD34 probes were used. Both are expressed in the 

10 AA4.1+Lin-/loSca+ckit+ subpopulation but not in the AA4.1 population. Following 
subtraction, the relative number of clones is reduced by up to 200-fold. In some cases 
(subtractions with AA4.1 material) the number of clones which "survive" the subtraction 
is on the order of 10-20,000. Because individual sequences may be represented more 
than one time, this does not necessarily imply that there are 10-20,000 differentially 

15 expressed genes. The exact number of unique sequences (complexity) in the pool of 
subtracted clones must be determined. A more thorough discussion of complexity is 
found in a subsequent section. These subtracted libraries should be enriched for 
sequences expressed in the primitive portion of the stem/progenitor cell hierarchy; that is 
in stem cells and/or in primitive clonogenic progenitor cells. Two other subtracted 

20 libraries, potentially enriched for sequences expressed in the most primitive stem cell but 
not in clonogenic progenitors were derived by subtracting the AA4.1+Lin-/loSca+ckit+ 
libraries with material from the closely related AA4.1+Lin-/loSca-ckit+ subpopulation. 

Each subtracted library is arrayed at high density onto nylon membranes. Each 
clone in the array has a unique address in microtiter plates. A density of 20-30,000 clones 

25 on a 22 x 22 cm. membrane is practical. Because of the inherent "noise" in any 
subtraction scheme, a positive selection criterion may be imposed in order to focus on 
true differentially expressed sequences. Details are presented in a subsequent section. To 
analyze the subtracted libraries, a high throughput sequencing effort was employed using 
three libraries. These are: 1) two AA4.1+Lin-/loSca+ckit+ cell libraries (standard and 

30 cap-finder) subtracted extensively with AA4.1 cell material and 2) a standard 
AA4.1+Lin-/loSca+ckit+ library subtracted with AA4.1+Lin-/loSca-ckit+ material. An 
average of 400 bp of 5' sequence was obtained from about 1000 clones. To facilitate a 
rational handling of sequence information and to focus attention on a small number of 
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clones for functional analysis a "flow of information" strategy was devised. (See, Figure 
1). 

A major component of this strategy is bioinformatics. This can be global 
(comparisons with outside databases) and local ("in-house" analyses within an individual 
5 library or comparison of several libraries). A relevant example of the latter is comparison 
of genes from murine stem cells with genes derived from their human counterparts. 
Global bioinformatic analysis provides much information. First, it establishes if a given 
sequence corresponds identically or closely to an already identified gene in the mouse, 
human or other mammalian species. Such homologies can be detected at the nucleotide 

10 level. This can provide evidence of hematopoietic expression for a previously described 
gene. Second, where the homologies are statistically significant but not identical, novel 
members of gene/protein families can be identified. Third, a wider net can be cast over 
the databases by using conceptual translation of a sequence in the homology comparisons. 
The Examples provided herein illustrate the power of this approach in revealing 

15 similarities to proteins from invertebrate organisms such as Drosophila, C. elegans, and 
even yeast. In many cases these proteins have functions which have been uncovered by 
the analysis of mutants. Based on protein homologies "virtual links" are drawn between 
developmental regulation in invertebrates (such as in germ-line development) and in 
hematopoietic stem cells. The Notch/Notch ligand pathway first defined in invertebrate 

20 cell-fate determination and recently implicated in hematopoietic regulation is a good 
example . 

A number of other putative proteins which share homology with key Drosophila 
proteins have been identified. Bioinformatics facilitates the recognition of peptide motifs 
such as EGF-like repeats, Ig-like domains or Zn-finger modules. Fourth, because the 

25 databases are annotated, predicted protein sequences can be assigned to cellular processes 
such as signal transduction pathways or apoptosis. It is also possible to categorize clones 
according to potential involvement in other mammalian stem cell systems such as the 
intestine. Fifth, it is feasible to perform virtual expression studies and to construct 
overlapping EST contigs which can yield virtual full-length cDNAs. 

30 The following discussion summarizes the general murine findings and highlights a 

panel of "interesting" subtracted clones. A number of full-length sequences have been 
determined. Bioinformatic analysis summaries on a collection of 863 clones derived 
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from AA4.1+Lin-/loSca+ckit+ libraries subtracted with AA4.1 RNA are shown in Table 
1. 

The present invention encompasses numerous bioinformatic search and 
comparison parameters. This is the first analysis of its kind in the hematopoietic system 
5 and several important points emerge. First there is a high proportion (-50%) of novel 
sequences. Sixteen percent do not show homologies in any EST database queried. A 
preliminary analysis of a small sequence set from an AA4.1+Lin-/loSca+ckit+ library 
depleted of sequences in common with the AA4.1+Lin-/loSca-ckit+subset has indicated 
an even higher percentage of novel genes. These are higher percentages than would 

10 emerge from a random analysis of an unsubtracted library. This was directly addressed by 
performing these analyses on a similar number of sequences selected at random from one 
of the mouse EST databases. Second, among the -50% of the clones which are 
significantly homologous to previously identified genes or proteins, the proportion of 
homologies to "housekeeping" genes is low. This underscores the effectiveness of the 

15 subtraction strategy. Third, an internal analysis of the data base has revealed few 
redundancies. This illustrates the degree of gene expression diversity between the two 
endpoints of the mouse hematopoietic hierarchy; thus supporting the starting hypothesis. 
A recent report has suggested that many sequences in mature blood cells are expressed in 
primitive, non-committed cells (Hu, M, et al. 1997). The data indicate that many of these 

20 are removed by the subtraction; thus uncovering a large, previously not described set of 
genes. Fourth the sequences with homologies to known genes or proteins can be 
subdivided according to protein families and putative function. Interestingly, a large 
percentage (32%) fall into the signaling protein category. Examples of these are 
described herein below. 

25 As part of a more sophisticated bioinformatics approach, automation of database 

searches, information cross-referencing, and annotation was employed. An illustrative 
example is automated weekly database queries with the sequence set. New, previously- 
unidentified homologies were automatically noted and reported. In the mouse studies 
several genes were encountered whose expression would be predicted to differ between 

30 the AA4.1+Lin-/loSca+ckit+ and the AA4.1- populations. Both murine CD34 and 
flk2/flt3 were identified in the sequenced population. This provides a good internal 
control for the screening strategy. As shown in Table 2, a "short list" of identified 
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molecules was generated based on bioinformatics and in many cases, expression- 
specificity verification. The features of some of the molecules merit discussion. 

The SA49P1 clone is homologous to sex comb on midleg; a member of the 
Drosophila Polycomb group of zinc-finger transcriptional repressors (Bornemann, D., et 
5 al. 1996). Polycomb proteins are regulators of homeobox (HOX) genes and maintain the 
developmental stability of transcriptional states (Simon, J. 1995) . There is currently 
great interest in HOX gene function in the biology of hematopoietic stem cells 
(Sauvageau, G., et al. 1995; Lawrence, H.J., et al. 1996) . The C4-80 gene is homologous 
to the Drosophila cornichon gene (Roth, S., et al. 1995) . Comichon is required during 

10 oogenesis for the induction of follicle cells, which provide the environment that supports 
oocyte development. Cornichon is a component of the Drosophila EGFR signaling 
pathway (Neuman-Silverberg, F.S., and T. Schupback 1996). The LL2-12 gene is similar 
to the Drosophila neurogenesis gene brainiac. Mutants defective in this extracellular gene 
product show neural hyperplasia. Brainiac also plays a role in oogenesis (Goode, S M et al. 

15 1992; Goode, S., et al. 1996) . Mosaic experiments show that brainiac is required in the 
developing oocyte. Brainiac involvement in the Drosophila EGFR pathway has been 
suggested. The LL2-35 gene is similar to the Drosophila germ cell-less gene (Jongens, 
T.A., et al. 1992) . The product of this gene is required for the specification of the 
Drosophila germ line. The ectopic expression of germ cell-less causes somatic cells to 

20 adopt the characteristics of pole cells (destined for the germ line). The homologies of the 
above clones to Drosophila genes were identified at the predicted amino acid level. This 
underscores the utility of the bioinformatic approach. In all four cases the cDNAs 
represent novel unpublished murine genes. Clearly, the involvement of three of the 
Drosophila genes in the ultimate stem cell system, the germline, coupled with the 

25 identification of homologous genes expressed in mouse hematopoietic stem /progenitor 
cells gives food for thought. Four other cDNAs identified are homologous to the 
Drosophila genes dishevelled and smoothened (mouse homologs are already described) as 
well as to kelch and discs large (not previously described in the mouse). DDI 16 was 
originally isolated in a DD comparison of AA4.1+ vs. AA4.1 cells. A cDNA clone 

30 identified with the DDI 16 probe was sequenced; the predicted protein is homologous but 
not identical to the murine beige gene product. The beige mutation in the mouse causes 
bleeding, immune system disorders and a coat-color phenotype (Perou, CM., et al. 1996; 
Fukai, K., et al. 1996; Barbosa, M.D., et al. 1996). Murine beige is thought to be the 
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homologue of a gene responsible for the Chediak-Higashi syndrome in human. This 
novel gene is likely to be the second member in the beige family; it is designated herein 
as "taupe". Two of the cDNAs in Table 2 were identified in a DD comparison of 
AA4.1+Lin-/loSca+ckit+ cells and cytokine differentiated, cultured cells. One focus was 
5 molecules whose expression disappeared with the loss of stem cell activity in culture. 
Cyt28 and Cytl9 fulfilled this criterion and were used to isolate full length clones which 
have been completely sequenced. Cyt28 encodes a novel, seven-transmembrane domain 
receptor in the same family as mouse F4/80 and human CD97 (secretin receptor 
superfamily) (Hamann, J., et al. 1995; Baud, W., et al. 1995) . Cyt 19 is a novel putative 

10 methyltransferase. In both of these cases protein family assignations were made after 
extensive bioinformatic analyses. At least in the case of Cyt28, an antibody will be a 
useful reagent because it may provide a means to further subdivide the stem/progenitor 
cell population. Such antibodies are provided by the present invention. The cDNA SA61 
is similar to a newly discovered molecule called p62dok. p62dok is a tyrosine 

15 phosphorylated protein which binds to rasGAP and is likely to be a common target for 
numerous tyrosine kinases including Abl and ckit (Carpino, N., et al. 1997; Yamanashi, 
Y., 1997) . Interestingly, p62dok is also constitutively phosphorylated in hematopoietic 
progenitor cells from chronic phase CML patients (Carpino, N., et al. 1997) . The 
predicted protein encoded by SA61 appears to be a second member of this protein class. 

20 Other representative molecules are listed in Table 2. Two cDNAs were identified as 
homologous to putative apoptosis regulators (SBSA56 and LL5-68, two cDNAs 
homologous to genes translocated in leukemias (LL5-03 and B2-67), several homologies 
to putative chromatin proteins (C4-23, C3-25 and LL2-89) and a LIM-domain encoding 
cDNA (LL5-96). Most interestingly, three cDNAs (B3-77, C2-48, and LL2-76) are 

25 homologous to genes expressed in intestinal crypts or during intestinal development. 

Isolated cDNAs were hybridized to slot blots representing globally amplified 
cDNAs from pools of individual progenitor and mature cells. Because the developmental 
potentials of these cells have been measured (by replating of siblings) it is possible to 
graphically represent the expression of a gene in various stages of the hierarchy. Three 

30 examples are shown in Figures 2A-2C. The Smc-34 cDNA is a completely novel 
sequence with a predicted leucine zipper and several potential membrane spanning 
domains. These examples underscore the value of "interfacing" the above-described 
approach with a single cell approach. Thus, it was demonstrated that a given gene is 
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expressed in at least multipotential progenitor cells and other members of the progenitor 
cell hierarchy. 

Human stem cell gene expressio n profiles . It is also a goal of the present 
invention to identify genes expressed specifically in human stem cells. It is useful to use 
a multi-species (mouse and human) approach to define stem/progenitor cell-specific gene 
profiles. One goal is to integrate mouse and human information. However, rather than 
proceeding directly to human homologs of the mouse genes already identified, an 
independent human effort was initiated. The rationale for this is multi-faceted. First, it is 
possible that some aspects of human and mouse stem cell biology are regulated in 
different ways. Although it is likely that most regulatory pathways will be conserved, it 
must be kept in mind that many properties ascribed to the most primitive stem cell 
population have been rigorously proven only in the mouse. Clearly a human lymphoid- 
myeloid stem/progenitor cell exists. However, the exact degree of proliferative capacity 
(the ability to give rise to oligoclonal hematopoiesis) as well as the exact spectrum of 
differentiation potentials of human stem cells have not yet been accurately measured. 
Several potential differences between mouse and human have already been mentioned. 
One additional indicator of differences may be the difficulty of gene-transfer into stem 
cells from large animals (Larochelle, A., et al. 1997). This is not likely to result solely 
from the quiescent status of the most primitive cells. Second, while it is usually possible 
to find human homologs for individual mouse genes by manipulating hybridization 
stringency, such conditions will vary for different genes. Therefore, to find human 
homologs for a large pool of mouse genes (i.e. 100) may be more labor intensive and 
costly than to independently determine a sequence profile of differentially expressed 
genes from human stem/progenitor cells. Relationships to the murine panel can then be 
determined electronically where it is easy to manipulate comparative parameters. 
Clearly, for some individual mouse genes it will be of great importance to physically 
identify human homologs. In some cases it may be possible to use existing human EST 
databases to quickly obtain the sequence of a human homolog. Third, the availability of a 
large panel of human sequences specifically expressed in stem/progenitor cells lends 
itself to the application of various chip and array technologies. Such technologies will be 
instrumental in identifying which subsets of human stem cell specific genes are up or 
down-regulated in the highly clonogenic stem cells from diseases such as Acute 
Myelogenous Leukemia (AML) and other leukemias.- Highly purified human stem 
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cells for cDNA libraries were obtained from normal donor BM. CD34+ cells were 
obtained from the mononuclear fraction using an immuno-magnetic affinity device 
(Miltenyi MACS column) and stained with a mixture of monoclonal antibodies (mAbs) 
designed to identify both primitive and mature cell surface markers. To identify antigens 
5 on mature cells a lineage (Lin) cocktail of FITC conjugated mAbs was used . These 
mAbs were directed against: CD3, CDllb, CD15, CD19, CD20, glycophorin A, and 
CD71 (CD3, CD15, CD19 and CD20 from Becton Dickinson, CD1 lb and glycophorin A 
from Coulter, and CD71 from Ortho Diagnostics). To identify primitive cells mAbs were 
used which were directed against CD34 (biotin conjugated and detected via a 

10 streptavidin-allophycocyanin reagent; Coulter) and either CD38 (phycoerythrin 
conjugated, Becton Dickinson) or CD90 (Thyl) (phycoerythrin conjugated; Pharmingen). 
The stained cells were analyzed using a dual laser Becton Dickinson FACStar Plus flow 
cytometer. Cells of the desired phenotype were sorted into siliconized tubes. To obtain 
material from cultured cells, CD34+ enriched populations (isolated as described above), 

15 were first cultured in serum-free medium (IMDM, 5 mg/ml low density lipoprotein, 2 
mM glutamine, human serum albumin, insulin, and transferrin, ) supplemented with IL-3 
(5ng/ml), KL (25ng/ml), and FL (25ng/ml). After 1-4 days of culture, the cells were 
purified as described above. Human libraries were constructed using the cap-finder 
technology, version 2. Amplified cDNAs originating from several independent BM 

20 donors were pooled. Representative cDNA libraries have been constructed from the 
following sources: (1) BM CD34+Lin- cells (7.1X105 independent recombinants), (2) 
BM CD34+Lin-CD38+ cells (1.9X106 independent recombinants), (3) BM CD34- cells 
(1.6X106 independent recombinants), (4) CB CD34+ cells (4.3X105 independent 
recombinants), and (5) CB CD34- cells (2.9X105 independent recombinants). The 

25 average cDNA insert size in all libraries is 1-2 kb. According to the comparative 
biological properties of the source material used for the mouse and human cDNA 
libraries, the following parallels can be drawn: (1) mouse AA4.1+Lin-/loSca+ckit+ s 
human CD34+Lin-, (2) mouse AA4.1+Lin-/loSca-ckit+ s human CD34+Lin-CD38+ and 
(3) mouse AA4.1 = human CD34-. Thus, in both species, collections of clones 

30 representing the beginning, the middle and the end-points of the hematopoietic hierarchy 
have been generated. Using procedures described above, the human libraries were 
subtracted and arrayed 3,000 clones from two libraries: (1) BM CD34+Lin- subtracted 
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with BM CD34+Lin-CD38+ material (referred to as: 38-)38+) and (2) BM CD34+Lin- 
subtracted with BM CD34- material (referred to as: 38-). Before initiating a sequencing 
effort pilot studies were performed to improve the resolution of the screen. Specifically, 
in order to develop techniques which would eliminate as much non-specific "noise" as 
5 possible from the libraries. RDA (PCR-Select, Clontech) were used . There are several 
salient features of RDA. First, it is rapid (2-3 days) and applicable to small cell numbers 
(ie. several thousand). It does not require the generation of cloned cDNA libraries 
therefore, numerous comparisons can be made simultaneously. Second, it yields short 
cDNA fragments (generated by 4-cutter restriction enzymes) which are representative of 

10 differentially expressed mRNAs. Third, it is based on the suppression PCR technique 
which prevents overamplification of abundant messages (Diatchenko, L., et al. 1996) . In 
addition, sequences expressed in both populations in an RDA comparison do not amplify 
exponentially. Differentially expressed populations of cDNA fragments are not obtained 
by physical enrichment but rather by selective PCR amplification. Fourth and most 

15 importantly, it simultaneously yields two populations each representing differentially 
expressed genes in one of the two starting samples. This bi-directionality is valuable 
because as discussed below it can simultaneously reveal up and down regulated genes. 
RDA comparisons were made between CD34+Lin- and CD34+Lin-CD38+ and between 
CD34+Lin- and CD34- cDNAs. The latter comparison was verified by hybridization. As 

20 shown in Figure 3A (right panel), the control, non-subtracted RDA cDNA population (38- 
) contains b-actin sequences which are missing in the two subtracted RDA populations, 
38- and 38-)38+. A differentially expressed gene (HDD-2, described below) is enriched 
in the 38- RDA population and at least retained in the 38-)38+ RDA population (Figure 
3A, left panel). Two, bi-directional, RDA cDNA populations (38- and the converse 34- 

25 )38) were used to probe (See Figure 3) duplicate arrays of a subtracted 38- library (Figure 
3B). The correct RDA probe hybridizes to considerably more clones than the incorrect 
probe (compare the greater numbers of spots in the left as compared to the right panel). 
Hybridization signals for each clone are doublets due to the arraying technique. 
Hybridization signals with the incorrect probe (right panel) suggest that further 

30 improvements to this strategy are worth pursuing. Because the subtractions are based on 
different technologies, it was reasoned that clones which "survived" the library 
subtraction and hybridized preferentially to the "correct" RDA probe would be more 
likely to represent true differentially expressed genes. Preliminary data from the mouse 
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studies had already suggested that this was indeed the case. The arrayed clones were also 
hybridized to probe populations from the entire starting non-subtracted libraries in order 
to eliminate residual cDNA clones corresponding to abundant mRNA species. 

In one analysis, a total of 77 clones (49 from the 38- subtraction and 28 from the - 
38-)38+ subtraction) which "obeyed" the RDA criteria were sequenced from the 5' end. 
In order to verify expression specificity, a series of pseudo-Northern blot analyses were 
performed. cDNA populations from independent purified sources of material was 
amplified. This is of particular importance with an outbred species such as humans. In 
these experiments additional material from CD34+Lin-CD90+ (Thyl+) (three separate 
purifications) and their CD90- (Thyl-) counterpart subpopulations was also included. 
Cytokine-cultured samples were also included. These were CD34+Lin- cells and their 
CD38+ counterparts purified after 1, 2 and 4 days of culture. The collective panel of 
cDNAs also includes four independently purified CD34+Lin- populations and their 
CD34+Lin-CD38+ counterparts. All of these amplified cDNAs have been arranged on 
numerous pseudo-Northern blots. In short, this allowed us to evaluate the expression of a 
given "sequence tag" in stem cell populations isolated by two different criteria, from a 
number of independent donors and after cytokine culture. Several interesting genes 
emerged from the sequence data set. Individual clones obtained from the -38-)38+ 
subtraction are designated 38 letter/number (i.e., 38A1) while those from the 38-)34+ 
subtraction are designated 34 letter/number (i.e., 34A1). One clone (38B5) was identified 
and determined to be human flk2/flt3. A second interesting clone (34B4) is closely 
related to a gene encoding TINUR (Figure 4A). Clone 34B4 may be a novel variant of 
TINUR due to a 25 amino acid, in-frame deletion. TINUR was identified as an orphan 
member of the steroid receptor superfamily (NGFI-B/nur77 subfamily)(Okabe, T., et al. 
1995). TINUR has also been implicated in apoptosis. An additional clone (34F4 ) is 
highly homologous to DAP-kinase. This protein is a serine-threonine kinase which has 
been implicated in cytokine (IFN- -induced apoptosis (Deiss, L., et al. 1995) . Clone 
34F4 (DAP-Kinase) and 34B4 (TINUR) both exhibit a stem-cell restricted expression 
pattern. Clearly, the identification of two genes whose products are implicated in 
apoptosis and whose expression is largely restricted to human stem cells is of interest. In 
addition, a cDNA (34 A5) was identified which is closely related to the MLF1 gene which 
is a translocation partner in t(3;5)(q25.1;q34)(Yoneda-Kato, N., et al. 1996) . This 
translocation is associated with Myelodysplastic Syndrome (a stem cell disease) and 
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AML. The sequence homologies and restricted expression pattern of 34A5 is shown in 
Figures 4B and 4C. In Figures 4C and 4D (and also 5A, 5B, and 6B) there are twenty- 
one samples of capfinder-amplified cDNA from various hematopoietic populations. 
From left to right these are: four CD34+Lin- populations, three CD34+Lin-CD90+ 
populations, two CD34- populations, four CD34+Lin-CD38+ (obtained from the same 
BMs as the CD38- samples in lanes 1-4), two CD34+Lin- samples (obtained from the 
same BMs as the CD90+ samples in lanes 5 and 6), three CD34+Lin- populations 
obtained after 1, 2 or 4 days of culture and finally their three CD34+Lin-CD38+ 
counterparts. A recent study shows that, at least with some types of AML, the disease 
can be transferred into NODSCID mice only by the leukemic CD34+CD38- 
subpopulation (Bonnet, D., and Dick, J.E. 1997) . These demonstrate MLF1 expression 
in normal stem cells. As shown in Figure 4D, nucleophosmin (NPM), the partner in this 
translocation was also identified. This is an example of non-specifically expressed 
u noise" in the screens. The 38G2 cDNA is closely homologous to the LTG9/MLLT3 
gene located on 9q22 and involved in t(9;l 1) leukemia (Iida, S., et al. 1993) (Figure 4E). 
A degree of stem cell expression specificity has also been observed. All of the above 
cDNA clones are likely to be identical to the homologous, previously identified genes. 
However, the suggested involvement of DAP-kinase and TINUR in apoptosis necessitates 
their inclusion in any comprehensive consideration of stem cell apoptotic pathways. 
Similarly, the expression of two genes associated with myeloid leukemias bears on any 
speculation regarding the primary transformation target cell as well as the origins of the 
ultimate clinical phenotype in these and other leukemic disorders. 

Other known genes to were found that exhibit identity or very close homology 
including: (1) G0S3, a fos-related gene (Heximer, S., et al. 1996) (Figure 5 A) and (2) 
HLA-DR (Figure 5B). G0S3 shows a specific expression pattern, while HLA-DR 
expression appears to be more variable. The expression status of Class II MHC on the 
most primitive human BM stem cell population is not entirely clear. It has been 
suggested to be present in a primitive, multipotent progenitor population but not in the 
most primitive stem cells (Sutherland, H.S., et al. 1989; Verfaillie, C, et al. 1990) . If 
true, this may suggest an additional negative selection parameter for future experiments 
designed to subdivide the stem/progenitor cell hierarchy. Of the first 75 sequences, 22 
have no homologies in the databases or homologies only to ESTs. Expression analyses 
on these clones are in progress. One gene of particular interest is called HDD-2 as well as 
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34B5, 34E1 and 38A11. The three latter designations reflect its isolation from both the 
38- as well as the 38-)38+ subtracted libraries. The designation HDD-2 reflects its 
independent isolation in a limited DD "first look" at molecular differences in the purified 
cell populations. The likely full-length sequence of HDD-2 is -500 bp. It contains a 
short open reading frame of 89 amino acids (SEQ.ID.No.: 71). The predicted peptide 
sequence is shown in Figure 6A. The 3' cDNA sequence contains a poly-A tail preceded 
by the canonical AATAAA poly-adenylation signal. Neither the nucleotide nor the 
predicted protein sequences of HDD-2 show homologies in any known gene or EST 
database. The expression profile of HDD-2 shown in Figure 6B, demonstrates that it is 
stem cell restricted. Also shown below (Figure 6C) is HDD-2 hybridization to a dot blot 
with numerous human pA+ mRNA samples (Clontech). HDD-2 hybridization is only 
visible in kidney (the other "spots" are background). It was confirmed that HDD-2 
corresponds to a single-copy human gene by genomic Southern blot (Figure 6D). In 
summary, the results of this very low throughput human sequencing effort substantiate 
the overall approach; that judicious pre-enrichments and selections will result in rapid 
identification of biologically interesting and often novel genes. Most importantly, these 
studies firmly establish the existence of genes whose expression correlates with the most 
primitive stem cell phenotype.- 

Other data were generated using (1) a murine stromal cell line to support enriched 
human stem/progenitor cells, and (2) tetracycline regulatable retroviral expression 
vectors. The stromal cell line AFT024 is efficient in long-term, in vitro maintenance of 
LTRA in purified murine fetal liver and adult bone marrow populations (Terstappen, 
L.W.M.M., et al. 1991) . Additional data demonstrated highly-efficient retroviral- 
mediated gene-transfer into murine LTRA as well as into primitive in vitro progenitors 
during the AFT024 cocultures. Further, it has been shown that AFT024 is very effective 
in supporting ELTCIC. Specifically, the CD34+CD38- immunophenotype as well as the 
functional capacity of these cells is maintained. The latter was measured in limiting- 
dilution replating experiments. Moreover, it has been shown that limiting numbers of 
human CD34+Lin- cells can give rise to both B cells and NK-cells when cultured on 
AFT024. However, the supportive activities of AFT024 on mouse and human 
stem/progenitor cells have been indistinguishable. In addition, more than 500 sequences 
have been analyzed from an AFT024-specific subtracted cDNA library. A number of 
candidate stem cell regulators have been identified. Three of these are dlk (Moore, K.A., 
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et al. 1997), a novel BMP/TGF-b superfamiiy member and a novel selectin-related 
molecule. The present invention contemplates identifying human stem/progenitor cell 
receptors and/or ligands for the AFT024 specific proteins. 

In order to facilitate functional studies of stem cell gene products several retroviral 
gene-transfer vector systems were constructed and characterized. All of these employ the 
293T cell retroviral packaging system (Kinsella, T., and Nolan, G. 1996). High titers of 
virus can be produced transiently without the time and labor consuming effort required 
for stable producer cell lines. Large cDNA populations can also be converted into virus 
populations (Kitamura. T., et al. 1995). It is preferable to have inducible (or repressible) 
vectors which are also selectable. Also, the single-transcription unit tetracycline (tet) 
repressible vector was modified (Hofmann, A., et al. 1996). This vector includes an 
enhancer/promoter deletion in the 3' LTR. The tet system is currently one of the best 
inducible expression systems available; regulation over a several hundred-fold range of 
expression is observed (Shokett, P.E., and Schatz, D.G. 1996) . Cloning sites were 
introduced in order to insert cDNAs in a sense or antisense orientation. A green 
fluorescent protein (GFP) marker was also included in these vectors driven by a 
thymidine-kinase promoter (TK-GFP) (Yang, T., et al. 1996; Cheng, L., and Kain, S. 
1996). The cDNA fragment of interest is under tet regulation while GFP is constitutively 
expressed in transduced cells. In order to confirm this, NIH3T3 cells are infected with a 
LacZ virus and GFP+ cells sorted. The GFP+ cells express LacZ in the absence of tet, 
while LacZ expression is undetectable in most cells after the addition of tet. The titer of 
this model construct is approximately 104/ml. which is suitable for tissue culture studies. 



Table 1. Known genes (or extensive aa homology): 436 (50.5%) 



by nucleotide: 407 
by amino acid only: 29 

to mouse: 230 

to human/other: 206 



Novel genes: 427 (49.5%) 



Grouped by function 

Signaling/receptors: 133 
Translational/posc-: 62 
Structural: 49 

Transcripuonal/post-: . 45 
Cell fate: 35 
Other 87 
Unknown function: 25 



Homologous only to expressed sequence tags: 288 (33.3%) 
No homology to any known nt or aa sequences: 139 (16.2%) 
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Putative familv 


Clone 


Method 


Cells com oa red 


Notes 


7-trans membrane receptor 


Cyt*28 


diff. display 


SC* fresh vs. 7d*cuitured 


secretin R tun^rfamilv 


Methyl transferase 


Cyt-19 


diff. display 


SC fresh vs. 7d-cultured 




C3-54 


subtraction 


SC vs. AA4' 




Aipanyl protease 


SA7 


subtraction 


SC vs. AA4 




Signal transduction molecules 


SA6f 


subtraction 


SC vs. AA4* 


UUR Idlluiy IHCmDCi 




LL2-02 


subtraction 


SC vs. AA4 


RTV associated 


beige-related proteins 


DDI 16 


diff. display 


AA4" vs. AA4" 


contains WD repeat 


Transcriptional regulators 


SA49P1 


subtraction 


SC vs. AA4" 


poiycomb homol. 




LL5-96 


subtraction 


SC vs. AA4" 


includes LIM domain 


G-protein signaling 


LU-39 


subtraction 


SC vs. AA4' 


cell cycle transition 




Bl-66 


subtraction 


SC vs. AA4" 


similar to BL34 


Apoptosis-rclated genes 


SBSA56 


subtraction 


SC vs. A A 4* 


SMT3A-related 




LL5-68 


subtraction 


SC vs, AA4" 


sim. to Requiem transcript, factor 


Chromatin proteins 


C4-23 


subtraction 


SC vs. AA4' 


yeast HST2-like 




C3-25 


subtraction 


SC vs. AA4* 


NHP2-like 




1X2-89 


subtraction 


SC vs. AA4' 


yeast SlS2-Iike 


Other stem cells (intestinal 


B3-77 


subtraction 


SC vs. AA4 - 


homologous to A4 gene 


crypts) 


C2-48 


subtraction 


SC vs. AA4- 


homologous to C 101 




LL2-76 


subtraction 


SC vs. AA4" 


homologous to EDPF 


Genes involved in leukemogenic 


LL5-03 


subtraction 


SC vs. AA4- 


tfl:ll)(q21:q23) 


translocations 


B2-67 


subtraction 


SC vs. AA4* 


t(X;U)(q28;qll) 


Homoiogues of Drosophila 


LL205 


subtraction 


SC vs. AA4* 


germ celMess 


developmentaJ genes 


LLM2 


subtraction 


SC vs. AA4 


brainiac 




C4-80 


subtraction 


SC vs. AA4* 


comichon 




P4.|4 




SC vs. AA4* 


discs-laree 



•The designation "SC" refers to sorted day 14 fetal liver cells having the phenotype AA4, 1 * Un' w Sca*ckit". 
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EXAMPLE 2 : Molecular cloning and characterization of AA4, an early 
marker of hematopoietic development 

In this example, the expression cloning and molecular characterization of AA4, a 
surface marker expressed on hematopoietic stem and progenitor cells is described. The 
5 results demonstrate that AA4 is a 130kDa type I glycosylated membrane protein whose 
structural organization suggests a role in cell adhesion. Expression analysis showed that 
high levels of AA4 are found in lung, heart, and bone marrow. It is not found in 
undifferentiated ES cells, but its expression is upregulated as these cells differentiate into 
colonies of hematopoietic precursors and endothelial cells. In the hematopoietic system, 
10 expression of AA4 correlates with the expression of stem cell markers CD34 and CD43. 
Functional studies revealed that AA4 coimmunoprecipitates with CD34 and CD43, 
suggesting that these proteins form a macromolecular complex which functions in the 
regulation of cell adhesion, proliferation and/or differentiation of hematopoietic cells. 
Introduction 

15 Monoclonal antibody AA4.1 was first described more than a decade ago 

(McKearn et al., 1984) and since then it has become a useful marker for the isolation and 
analysis of hematopoietic cells (McKearn et al., 1985; Jordan et al., 1990; Fujimoto et al., 
1996). A number of works have shown that the antigen recognized by AA4.1 is present 
on a subset of primitive hematopoietic progenitors found at various stages of development 

20 in sites of active hematopoiesis in yolk sac (Cumano et al., 1993; Auerbach et al., 1996; 
Yoder et al., 1997), AGM region (Godin et al., 1995; Marcos et al., 1997), fetal liver 
(McKearn et al., 1985; Jordan et al., 1990; Cumano and Paige, 1993), and bone marrow 
(Li et al., 1996; Szivassy and Cory, 1993). In yolk sac, AA4-positive cells are first 
detected at day 8-10 of gestation (Cumano et al., 1993; Sanchez et al., 1996). At day 9-10 

25 of gestation AA4 + c-Kit + progenitors are found in the P-Sp/AGM region (Sanchez et al., 
1996; Marcos et al., 1997), and by day 14 of development, AA4 defines 0.5-1.0% of the 
fetal liver tissue that contains the entire hierarchy of primitive hematopoietic cells (Jordan 
et al., 1990). Proliferation within each successive compartment results in increased total 
number of progenitor cells. Antigen density per cell also increases with developmental 

30 progress, which is especially marked for c-Kit and AA4 (Marcos et al., 1997). In bone 
marrow, HSC are found in both AA4+ and AA4" subpopulations, although in adult 
marrow AA4 is largely regarded to be a marker of early B lymphoid lineage. In addition, 
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recent studies demonstrate that the expression of AA4 parallels the onset of hematopoietic 
development in differentiating ES cells (Kabrun et al., 1997; Lin and Neben, 1997; 
Potocnik et al., 1997). Taken together, these results indicate that AA4 plays an important 
role in hematopoiesis and has to be studied in more detail. 
5 In order to achieve this goal, AA4 was molecularly cloned and characterized. The 

results demonstrate that AA4 is a BOkDa type I transmembrane glycoprotein whose 
structure suggests a role in cell adhesion. Expression analysis showed that high levels of 
AA4 are found in lung, heart, and bone marrow. In the hematopoietic system, expression 
of AA4 correlates with the expression of stem cell markers CD34 and CD43. Functional 
10 studies indicate that AA4 coimmunoprecipitates with CD34 and CD43, suggesting that 
these proteins form a macromolecular complex which may function in the regulation of 
cell adhesion, proliferation and/or differentiation of hematopoietic cells. 
Results 

£famng QfAA4. In order to identify AA4, AA4.1 monoclonal antibody was used 

15 to screen various cell lines and primary hematopoetic cells. The murine B lymphoid cell 
line D2N was found to expresses relatively high levels of AA4 antigen (see Table 3). 
Immunoprecipitation of protein extracts prepared from D2N cells showed that AA4.1 
recognizes a protein with apparent molecular weight (M r ) 130kDa (Figure 7). This 
protein was also present in extracts prepared from B lymphoid cell line M2.4, and AA4 

20 positive hematopoietic cells derived from bone marrow and fetal liver. 

To isolate AA4, a cDNA library was prepared from D2N cells and cloned into the 
polylinker region of a retroviral expression vector REBNA (see Materials and Methods). 
Following production of retroviruses, NIH 3T3 cells were infected with the recombinant 
cDNA library and selected for AA4 expression by flow cytometry using PE-conjugated 

25 AA4.1. After two rounds of sorting, genomic DNA extracted from AA4-positive cells 
was analyzed by per amplification using viral vector primers. This resulted in the 
amplification of a 3.1 kbp cDNA which was gel-purified and subcloned for further 
analysis. Infection of NIH 3T3 fibroblast or EML CI hematopoietic cells with 
REBNA/AA4, a recombinant retrovirus expressing the cloned cDNA, has lead to the 

30 acquisition by cells of high affinity to AA4.1 mAb (Figure 8 A). In infected cells, AA4.1 
detects a 130kDa surface protein which comigrates with the endogenous AA4 from D2N 
cells (Figure 8B), thus indicating that the cloned cDNA encodes AA4. 
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Sequence analysis o/AA4 1 Sequence analysis of the cloned cDNA (SEQ. ID 
No.: 72) showed that it has a single open reading frame encoding a protein of 644 amino 
acids (see Figure 9)(SEQ. ID No.: 73. The deduced amino acid sequence includes a 
putative leader peptide and the mature protein which starts at position 20. The protein 
contains a long N-terminal extracellular region, a single putative hydrophobic 
transmembrane region, and 47 amino acids of the C-terminal cytoplasmic domain. The 
extracellular part of AA4 is composed of two major structural moieties. The N-terminal 
region contains a C-type lectin domain (CTL) which has 32% sequence homology to 
endothelial cell receptor thrombomodulin. This region is followed by a cysteine-rich 
domain composed of six epidermal growth factor (EGF)-like repeats, three of which are 
consistent with the calcium-binding EGF motifs. Similar repeats are found in the 
extracellular domains of a large number of membrane-bound proteins and in proteins 
known to be secreated (Bork et al., 1996). 

Database searches revealed that AA4 exhibits high homology to ClqRP, the 
human receptor for complement component Clq expressed on surfaces of myeloid cell 
lineage and endothelial cells (Nepomuceno et al., 1997). Sequence alignment showed that 
AA4 and ClqR have approximately 68% identical amino acid positions and similar 
domain structures. Highest homologies were found within the N-terminal parts of the 
proteins and their C-terminal cytoplasmic domains (see Figure 10A and 10B), suggesting 
that AA4 and ClqR may share functional properties. 

The amino acid sequence of AA4 contains numerous potential O-linked and N- 
glycosylation sites. Although the predicted M r of AA4 is 67.4 kDa, two protein bands 
exhibiting M r 105kDa and 130kDa respectively, are immunoprecipitated by AA4.1 mAb 

in cells infected with REBNA/AA4 (Figure 8C). When 35 S-labeled cells are chased with 
nonradioactive media, the intensity of the 105kDa band rapidly decreases, while the 
intensity of the 130kDa band increases, thus indicating that pi 30 is the mature form of the 
protein, whereas pi 05 is its precursor. In agreement with this conclusion, 
immunoprecipitation of biotinyated cells using AA4.1 reproducibly results in the 
detection of a 130kDa surface protein (see Figure 8B). 

Expression patterns of AA4 Northern blot analysis showed that in adult mouse 
tissues, AA4 is expressed at high levels in lung, heart, and bone marrow. No detectable 
expression was found in brain, testis, spleen, and thymus (Figure 10B). In normal tissues 
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and transformed cell lines, a 7kb RNA species hybridizes with the cloned cDNA (Figure 
1 1 A and 1 IB). In addition to the 7kb species, poly(A)-RNA from D2N cells contains a 
minor band corresponding to a 3.2 kb mRNA (see Figure 1 1 A, lane 8). Similarity search 
against expressed sequence tags (ESTs) showed that databases contain at least seven 
different sequences corresponding to the 3'-untranslated region of the cloned aa^cDNA 
which were isolated from the following mouse tissues: colon (Genbank accession No 
AA929174), heart (AA435107), lymph node (AA185911 and AA267407), lung 
(AA220480), mammary gland (AI021507), and spleen (AA145088). Sequence identity 
with the ESTs abrogates upstream of nucleotide G at position 2481 in the 3-untranslated 
region of aa^cDNA, thus indicating that the cloned cDNA corresponds to an alternatively 
spliced 3.2 kb aa^mRNA. 

Expression ofAA4 in hematopoie tic progenitor cells AA4 is produced in murine 
hematopoietic progenitor cells and immature B lymphocytes found at various stages of 
development in yolk sac (Godin et al., 1995; Marcos et al., 1997), fetal liver (McKearn et 
ah, 1985; Jordan et al., 1990), and bone marrow (Cumano et al., 1992; Li et al., 1996). To 
confirm that the cloned cDNA encodes AA4, RT-PCR was performed on hematopoietic 
cells fractionationated using several different techniques. The cells analyzed included 
AA4+ and AA4" fetal liver (FL) cells; AA4+ FL cells fractionated into Lin lo c-Kit + Sca- 
1 + and Lin lo c-Kit + Sca-l" populations; and hematopoietic progenitors isolated from bone 
marrow (BM) by cell sorting using combinations of different surface markers. 

Figure 1 2A shows that aa4 was amplified from AA4+ FL cells, whereas in AA4" 
cells it was only present at low levels. AA4 expression was high in Lin lo c-Kit + Sca-l+ 
cells enriched for HSC activity. In adult marrow cells, aa4 was amplified from Lin + cells 
and Lin"c-Kit" f Sca-l + CD34 + multipotential progenitors. At lower levels aa4 was present 
in Lin-c-Kir f Sca-l+CD34- long term reconstituting HSCs (Figure 12B). In all tested cells 
aa4 expression correlated with the expression of stem cell markers CD34 and CD43. 

Embryonic stem (ES) cells which have been shown to generate progenitors for 
most hematopoietic lineages during differentiation in vitro (Keller, 1995) were also tested 
in this experiment. Figure 12C shows that aa4 was not found in undifferentiated ES cells 
but its expression was upregulated as these cells differentiated and formed blast cell 
colonies (BL) and colonies of more differentiated hematopoietic cells (HMT). These 
results are in line with previous studies which showed that AA4 is expressed in ES- 
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derived hematopoietic precursors (Kabrun et al., 1997; Ling and Neben, 1997; Scott et al., 
1997) and show that this expression is not abrogated upon differentiation of blast cell 
colonies into endothelial cells (ENT in Figure 12C). 

Ectopic expression of AA 4 has no mitogenic effect EML CI and NIH 3T3 cells 
5 expressing exogenous AA4 did not exhibit morphological changes indicative of 
alterations in their growth properties. To examine the effect of AA4 on growth 
phenotypes in more detail, NIH 3T3 and primary mouse embryo fibroblasts were infected 
with a retrovirus expressing AA4 and maintained in high and low serum conditions. 
FACS analysis of the transduced cells confirmed that in each case the efficiency of 
10 infection was close to 100%. However, examination of growth rates showed that 
overexpressed AA4 had no apparent effect on proliferation of both cell types as compared 
to control uninfected cells or the corresponding cells infected with a retrovirus expressing 
GFP. 

AA4 coimmimoprecipit ates with CD34 and CD43 To investigate interactions 

15 with other proteins, hematopoietic and fibroblast cells expressing AA4 were 
immunoprecipitated with AA4.1 mAb and then examined by immunoblot analysis using a 
panel of antibodies directed against membrane-associated proteins. This analysis revealed 
that AA4 coimmunoprecipitates with CD34, a membrane glycoprotein selectively 
expressed within the hematopoietic system on stem and progenitor cells, and CD43 which 

20 is a major O-glycosylated sialomucin found on the surfaces of most leukocytes. This 
result is in line with previous studies which showed that CD43 coimmunoprecipitates 
with the human ClqRP (Guan et al., 1991; 1994). Figure 13B shows that in the murine 
D2N and EML CI cells, a 52kDa protein is the major isofonn of CD43 that associates 
with AA4. This 52kDa protein was found to be reactive with both the N-terminal (SI 9) 

25 and C-terminal (Ml 9) anti-CD43 antibodies, indicating that it is not a breakdown product. 
In NIH 3T3 fibroblasts coexpressing CD43 and AA4, a 54kDa and a 170kDa CD43 
isoforms coimmunoprecipitated with AA4 pointing to the glycosylation differences 
brtween CD43 expressed in different cell types. A 115kDa CD43 isoform which 
previously have been shown to be sialylated and thus overly negatively charged (Guan et 

30 al., 1994) did not form macromolecular complexes with AA4 in NIH 3T3 cells. Similarly, 
AA4 did not coimmunoprecipitate with a 1 15kDa CD43 isoform found in D2N cells and 
a 120kDa isoform found in EML CI (Figure 13B). 
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To examine associations of AA4 with CD34, both proteins were expressed in Rat- 
1 cells following infection with the corresponding retroviruses. See Figure 13C. 

In Rat-1 cells coexpressing AA4 and CD34, AA4.1 the results also demonstrate 
that AA4 coimmunoprecipitates with CD34. Studies have suggested a role for both CD34 
5 and CD43 in the regulation of adhesion, growth and differentiation of hematopoietic 
precursors (Ardman et al., 1992; Bazil et al., 1997; Chen et al., 1996; Suzuki et al., 1996; 
Wood et al., 1997; Stockton et al., 1998). Coexpression of AA-Fc fusion protein in NIH 
3T3 cells stably producing AA4 showed that AA4-Fc forms heterodimeric complexes 
with AA4. Figure 1 1 A shows that equimolar amounts of AA4 and AA4-Fc were 
10 coprecipitated from these cells by protein A, indicating that AA4 is prone to homo- or 
heterodimerization. 
Discussion 

This Example describes the expression cloning and sequence analysis of AA4, a 
molecular marker expressed on hematopoietic stem and progenitor cells. The CDNA 

15 encoding AA4 was isolated from a retroviral cDNA library prepared from the murine 
D2N lymphoid cell line. Sequence analysis of the cloned cDNA revealed that AA4 is a 
type I transmembrane protein composed of 625 amino acids. The extracellular part of the 
molecule contains two major structural moieties, a C-type lectin carbohydrate recognition 
domain and six EGF-like domains. Similar repeats have been found in a large number of 

20 membrane-bound proteins or in proteins known to be secreted. The cytoplasmic domain 
of AA4, in contrast, bears no structural similarity with known protein families. Instead, 
AA4 revealed strong homology to ClqR, the human receptor for complement component 
Clq which is predominantly expressed in phagocytic cells such as monocytes, neutrophils, 
and endothelial cells (Nepomuceno et al., 1997). Sequence alighnment shows that AA4 

25 and ClqR have approximately 68% identical amino acid positions and share similar 
domain structure. Highest homologies were found within the Nterminal parts of the two 
proteins and their C-terminal cytoplasmic domains, suggesting that AA4 and ClqR may 
have similar functional properties. Expression of AA4 correlates with the expression of 
CD34 and CD43, two other surface markers normally present on murine hematopoietic 

30 stem and progenitor cells. Previous studies have shown that AA4-positive cells are first 
detected in yolk sac at day 8-10 of gestation (Cumano et al., 1993; Sanchez et al., 1996). 

At day 9-10 pc AA4+c-Kit+Mac-l+ progenitors are found in the P-Sp/AGM 
region (Sanchez et al., 1996; Marcos et al., 1997), and by day 14 of gestation, AA4 
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defines 0.5-1.0% of the fetal liver tissue that contains the entire hierarchy of primitive 
hematopoietic cells (Jordan et al, 1990). As proliferation within each successive 
compartment results in increased total number progenitor cells, antigen density per cell 
also increases with developmental progress, which is especially marked for c-Kit and 
5 AA4 (Marcos et aL, 1997). A significant proportion of hematopoietic progenitors from 
yolk sac also express CD34, CD38, CD43, CD44, and Mac-1 however show little or no 
expression of CD4, CD8, CD45R and Sea-. These same markers are present on FLderived 
HSC which also express Sca-1. Adult marrow HSCs also express cKit, CD38 and Sca-1, 
but do not normally express Mac-I or AA4. In BM, HSC are found in both AA4+ and 

10 AA4-subpopulations, although in adult marrow AA4 is largely regarded to be a marker of 
early B lymphoid lineage 0. These results indicate that the expression of cell surface 
antigens changes on HSC during ontogeny and that differential display of theses cell 
surface markers may reflect relationship between HSCs that contribute to multi lineage 
hematopoiesis and distinct anatomical sites during development. 

15 Methods 

Cells and tissue culture. NIH 3T3 fibroblasts were grown in DME medium 
supplemented with 10% fetal calf serum (FCS). D2N cells were grown in RPMI 1640 
medium containing 10% FCS. EML CI cells were grown in IMDM supplemented with 
20% horse serum and 8% BHK/MKL conditioned medium (Tsai et al., 1994). To 

20 maintain the multipotentiality of EML CI, the cells were kept at low density (0.5 - 5x10- 
5/ml) and subcultured every two days. Cell lines constitutively expressing AA4 and GFP 
were derived from NIH 3T3 or EML CI cells by infecting with the corresponding 
recombinant retroviruses. 

Retroviral- mediated gene transler . Retrovirus expression vector REBNA was 

25 constructed by substituting the LacZ gene contained within the EcoRI-Notl fragment of 
plasmid LZRSPBMN-Z (Kinsella and Nolan, 1996) with a synthetic polylinker composed 
of EcoRl, Xhol, Sfil, and Notl sites. Retroviral vector REBNA/IRESGFP contains a 
poliovirus IRES element and the CDNA encoding color-enhanced GFP (S. Zolotukhin, 
Gainesville, FL) inserted into REBNA. 

30 For DNA transfections, cells were plated at a density 2xlo6 cells per 60mm dish 

and transfected with 5ug of plasmid DNA using 20ul of lipofectamine reagent 
(GibcoBRL). REBNA-transfected cells were selected in puromycin (2ug/ml) and grown 
to confluence prior to collecting virus supernatant. For infections" the culture medium 
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was supplemented with polybrene (Sigma) at 5ug/ml. The appropriate virus was added 
and incubated overnight. Multiple infections were carried sequentially, with the 
appropriate selection between. 

Isolation o.t RNA and C DNA cloning . Poly(A) RNA prepared from D2N cells 
5 was converted into CDNA using Superscript II Reverse Transcriptase (GibcoBRL) and an 
oligo(dt) primer containing NotI site, 5'-TGGTGTCG ACGC AG AGTAGCGGCCGCT 1 8 
(SEQ.ID.No. :74). The second strand was synthesized using DNA polymerase I in 
combination with E. coli RNAse H and E. coli DNA ligase as described (Gubler and 
Hoffman, 1983). An adaptor composed of complementary oligonucleotides, 5'- 

10 GGCCCGGGCCGGCC (SEQ.ID.No. :75) and 5'-TCGAGGCCGCCCGGGCC 
(SEQ.ID.No.:76), was ligated to the CDNA and cut with Notl to produce CDNA 
molecules with Notl and Xhol termini for directional cloning. After size fractionation in 
agarose gel, cDNAs larger than 2.5 kbp were ligated into Notl and Xhol cut plasmid 
REBNA and electroporated into electrocompetent DH12S cells (GibcoBRL). Plasmid 

15 DNAs were transfected into 293 -derived packaging cell line for retrovirus production. 
Virus-containing supernatants were collected and stored at -80C. 

NIH 3T3 cells infected with the recombinant retroviruses representative of D2N 
CDNA library were selected for the production of AA4 by flow cytometry using 
phycoerythrin-conjugated AA4.1 mAb. After two rounds of sorting, genomic DNA 

20 isolated from AA4-positive cells was subjected to per amplification using retroviral 
vector primers, 5'CAGCCCTCACTCCTTCTC (SEQ.ID.No.: 77) and 5'- 
GGTGGGGTCTTTCATTCC (SEQ.ID.No.: 78) (Kitamura et al., 1995). Amplified 
CDNA was gel purified and subcloned into pbluescript SK and REBNA plasmid vectors. 
Nucleotide sequences were analyzed using NCBI Blast database search programs and 

25 ExPASy molecular biology server from the Swiss Institute of Bioinformatics. 

Northern blot hybridization. RNAs prepared using acid guanidinium thiocyanate- 
phenol extraction procedure (Chomczynski and Sacchi, 1987) were separated.on 
formaldehydeagarose gels and blotted onto the Hybond-N nylon membranes 
(Amersham). Hybridization probes were derived from cloned cDNAs using Ready To 

30 Go DNA labeling beads (Pharmacia Biotech). Hybridizations were performed as 
described previously (Petrenko et al., 1997). 

Cell labeling and Immunoprecipitations. The ECL protein biotinylation system 
(Amersham) for the detection of cell surface proteins was used as recommended by the 
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manufacturer. For radioactive labeling, 2xlo6 cells were incubated with 200uCi 
Translabel (ICN) in 2ml of cysteine and methionine-deficcient medium for 2hr at 370C. 
Cells were washed in PBS and lyzed lOmin. on ice in 500ul of NP40 buffer containing 
20mM TrisHCI pH 7.6; 150mM NaCl; 0.5% NP40; ImM PMSF; 5mM benzamidine; 
5 ImM sodium vanadate; lOug/ml aprotinin; lOug/ml leupeptin. Lysates were cleared by 
centrifugation and 2ug of soluble antibody were added to the supernatants followed by 
25ul of Protein G-Sepharose suspension (Sigma). After 2 to 4hr incubation at 40C with 
rotation, protein Gantibody complexes were pelleted and washed in the successive 
changes of wash buffer I (1M NaCl; lOmM TrisHCI pH 8; 0.1% NP40); wash buffer 2 

10 (O.IM NaCl; lOmM TrisHCI pH 8; 0.1% NP40); wash buffer 3 (lOmM TrisHCI pH 8; 
0. 1% NP40). Samples were eluted by boiling 2min. in 2Xgel loading buffer, separated by 
SDS/PAAG, dried and exposed to X-ray film. 

Western blot analysis . Protein extracts for Western blot analysis were prepared as 
described (Morrison et al., 1991). The antibodies used included goat anti-mouse CD43 

15 polyclonal IgG (M19 and S-19, Santa Cruz) in combination with HRPconjugated 
secondary antibodies and ECL detection system (Amersham). 

Flow cytometry and RT-PCR. Timed-pregnant mice and 5- to 7-week-old female 
mice (C57BI/6J) were purchased from Jackson Laboratory (Bar Harbor, ME). AA4- 
positive cells were isolated from day 14 fetal liver by immunopanning on Petri dishes 

20 coated with AA4.1 antibody (lOug/ml). Hematopoietic stem cells were purified from 
AA4-positive fraction by staining with lineage-specific antibodies as described previously 
(Moore et al., 1997). Three-color fluorescence-activated cell sorting for lineage negative 
to low, Sca-I(+), c-Kit(+) cells was performed on a multilaser FACS Vantage with 
CellQuest software (Beckton Dickinson). ES cells differentiated into blasts cell colonies, 

25 hematopoietic progenitors, and endothelial cells prepared as described previously. 

For RT-pcr, poly(A)-RNA isolated from sorted cells was converted into CDNA 
using Superscript II Reverse Transcriptase (GibcoBRL) and CapFinder CDNA 
amplification kit (Clontech). Gene-specific primers for per amplification included: 5- 
TTCAGCAAGCCCTGACTC (SEQ.ID.No.:79) and 5 f GCCACCTTCGAAGCAATC 

30 (SEQ.ID.No.:80) (AA4); 5'-GAGCGGTACAGGAGAATG (SEQ.ID.No.:81) and 
5'GCCCACCCAACCAAATCA (SEQ.ID.No.:82) (CD34); 5'- 

ACCGCGTTCTTCTGTAAC (SEQ.ID.No,:83) and 5'CAGCTAACAGCAGGATCC 
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(SEQ.ID.No.:84) (CD43); G3PDH Control Amplimer Set (Clontech) for the amplification 
of GAPDH. 
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EXAMPLE S: IN VITRO MAINTENANCE OF HIGHLY PURIFIED, 

TRANSPLANTABLE HEMATOPOIETIC STEM CELLS 

The cellular and molecular mechanisms which regulate even the most primitive 
5 hematopoietic stem cell are not well understood. This example details a systematic 
dissection of the complex hematopoietic microenvironment in order to define some of these 
mechanisms. An extensive panel of immortalized stromal cell lines from murine fetal liver 
was established and characterized. Collectively, these cell lines display extensive 
heterogeneity in their in vitro hematopoietic supportive capacities. This example describes a 

10 long-term in vitro culture system, utilizing a single, stromal cell clone (AFT024) that 
qualitatively and quantitatively supports transplantable stem cell activity present in highly 
purified populations. Disclosed is multi-lineage reconstitution in mice that received the 
equivalent of as few as 100 purified bone marrow and fetal liver stem cells which were 
cultured for 4-7 weeks on AFT024. The cultured stem cells meet all functional criteria 

15 currently ascribed to the most primitive stem cell population. The levels of stem cell activity 
present after 5 weeks of coculture with AFT024 far exceed those present in short-term 
cytokine-supported cultures. In addition, the maintenance of input levels of transplantable 
stem cell activity is accompanied by the expansion of other classes of stem/progenitor cells. 
This suggests that the stem/progenitor cell population is actively proliferating in culture and 

20 that the AFT024 cell line provides a milieu which stimulates progenitor cell proliferation 
while maintaining in vivo repopulating activity. 
Introduction 

Mammalian blood formation originates in a small population of hematopoietic stem 
cells. The hallmark features of these cells are: (1) a hierarchical multilineage differentiation 

25 potential with the ability to clonally give rise to at least 8 distinct cell lineages, (2) self- 
renewal capacity which is reflected in the life long continuous activity of few, in some cases 
single, stem cells and (3) a dramatic proliferative potential which is ultimately responsible 
for the production of large numbers of mature blood cells. (Leminschka, LR. 1992; 
Morrison, S.J. et al. 1995; Harrison, D.E. 1980). During the past decade much progress has 

30 been made in providing a physical phenotype for this rare population of stem 
cells.(Spangrude, G.J., et al. 1995; Lemischka, I.R. 1992; Bauman, J.G., et al. 1998). 
However, currently, the only reliable functional assay system for the most primitive stem 
cell compartment is long-term in vivo transplantation. No in vitro system has been 
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developed which adequately recapitulates stem cell behaviors. Therefore, the cellular and 
molecular mechanisms that regulate the biology of stem cells have remained obscure. 

A major challenge in stem cell research is the establishment of culture systems which 
facilitate in vitro maintenance of long-term transplantable stem cell activity. This is a 
necessary first step towards a cellular and molecular understanding of the regulatory 
mechanisms which mediate commitment versus self-renewal decisions. Moreover, the 
establishment of such culture systems is a prerequisite for the potential expansion of 
undifferentiated stem cell populations as well as for the generation of stem/progenitor cells 
committed to selected lineages. 

Efforts to develop culture systems for the maintenance of transplantable stem cells 
can be subdivided into two broad categories: (1) those utilizing defined cytokine 
combinations as the only culture supplements and (2) those relying on a pre-established 
stromal monolayer as an additional supportive component (with or without exogenously 
added cytokines). Both of these strategies have met with only limited success. In the first 
case, it has been repeatedly demonstrated that combinations of cytokines can exert potent 
stimulatory effects on stem/progenitor populations. (Spangrude, G.J., et al. 1988; Jordan, 
C.R. et al. 1990; Fleming, W.H., et al. 1993). In some studies, highly purified stem cells (Li, 
C.L., and Johnson, G.R. 1990; Spangrude, G.J. and Johnson, G.R. 1990). have been used 
and the direct effects of cytokines have been demonstrated at the single cell level. (Jones, R. 
et al. 1990; Uchida, N. et al. 1993). While informative, the vast majority of these studies are 
limited by their strictly in vitro nature. Thus, it is feasible to expand, replatable in vitro 
progenitor populations (Li, C.L., and Johnson G.R. 1992; Uchida, N. et al. 1993) and to 
stimulate colony- formation by cells with both myeloid-erythroid and lymphoid potentials, 
Jones, R, et al. 1990; Uchida, N. et al. 1993; Trevisan, M, and Iscove, N.N. 1995; Ogawa, 
M. 1993) however, the equivalence of these progenitor cells with the in vivo transplantable 
stem cell population remains speculative. Several studies have clearly demonstrated a 
dramatic loss of in vivo repopulating potential as a result of cytokine driven in vitro 
proliferation.(Knobel, et al., 1994; Peters, et al, 1995; Traycoff, et al., 1996). A small 
number of studies have shown that defined cytokine combinations promote the maintenance 
of transplantable activity.(Rebel., et al., 1994). However, most of these are limited both by 
the use of very short culture periods, the exact nature of the in vivo assay, and the use of 
non-enriched stem cell sources.(Muench, et al., 1993; Holyoake, et al., 1996; Soma, et al., 
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1996). This precludes interpretations suggesting a direct action of the given cytokine(s) in 
maintaining transplantable activity. 

A further complication with defined cytokine studies is the inability to ascribe in vivo 
physiological relevance to the observed effects. It has long been accepted that in the intact 
animal, stem cells are found in close association with discrete cellular 
microenvironments.(Lord, et al., 1975; Trentin, et al., 1970; Weiss, et al., 1991; Wolf, 1979). 
These observations suggest both the existence of stem cell niches and the notion that in vivo 
stem cell regulatory mechanisms are likely to require cell-cell contact or short range 
interactions.(Dorschking, 1990). Efforts to understand the features of the hematopoietic 
microenvironment began with the establishment of the Dexter long-term culture (LTC) 
system. (Dexter, et al., 1977). In this culture system hematopoiesis is maintained for weeks 
or months by a heterogeneous adherent cell monolayer derived from bone marrow (BM). 
While some degree of transplantable stem cell maintenance and self renewal (Fraser, et al., 
1990) has been demonstrated, a general feature of the Dexter-LTC is a dramatic net decrease 
of stem cell activity over time. (Harrison, et al., 1987; Van der Sluijs, et al., 1993). Although 
much progress has been made, especially in studies of human stem/progenitor cells, 
(Sutherland, et al., 1989; Verfaillie, et al., 1995; Hao, et al., 1996) a further drawback of this 
system is the heterogeneity of the stromal cell types present in the supportive monolayer. 
This hampers the identification of regulatory mechanisms. Studies have been reported 
where the heterogeneous stromal monolayer is replaced with cloned stromal cell lines. 
(Roberts, et al., 1987; Kodama, et al., 1984; Issad, et al., 1993; Wineman, et al., 1993). Many 
of these cell lines can support in vitro myelopoiesis, (Suzuki, et al., 1992; Neben, et al., 
1993; Kodama, et al., 1992) B-lymphopoiesis (Collins, et al., 1987; Whitlock, et al., 1982) 
or in some cases both.Wineman, et al., 1993; Wineman, et al., 1996) However, very few 
studies have focused on the in vitro maintenance of the most primitive transplantable stem 
cell compartment. Moreover, with one exception (Szilvassy, et al., 1996) the studies which 
have focused on the in vitro maintenance of this stem cell population begin with 
heterogeneous unpurified sources of hematopoietic activity. Wineman, et al., 1992, 1996; 
Deryugina, et al., 1994). Such populations contain numerous non-hematopoietic stromal cell 
types. Therefore, it has not yet been possible to assign a direct stem cell supporting 
phenotype to a given stromal cell line. 

It was hypothesized that the rare frequency of primitive, stem cells may suggest an 
equally rare frequency of stem cell supporting microenvironmental niches. Accordingly, we 
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established and characterized a large panel of conditionally immortalized, cloned stromal 
cell lines from mid-gestation fetal liver. This organ was chosen because, during 
development, it is here that the stem cell compartment is undergoing self-renewal expansion 
in addition to differentiation. (Moore, et al., 1970). The cell lines were generated as 
5 previously described, (Wineman, et al, 1996) by immortalization with a temperature 
sensitive SV40TAg. (Frederiksen, et al., 1988) The clonal nature of the AFT024, 2018, and 
2012 cell lines was verified by Southern blot analysis which detected a single, unique 
proviral integration locus in their genomic DNA. In order to identify potentially interesting 
cell lines, we used a "cobblestone area" (CA) assay (Ploemacher, et al., 1989) was initiated 

10 with BM taken from mice injected two days previously with 5-fIuorouracil (5-FU). It has 
been suggested that CA colonies which appear after a prolonged culture period are derived 
from more primitive stem cells, possibly identical to some in vivo transplantable entities. 
(Ploemacher, et al., 1991). Therefore, a goal was in identifying cell lines which support such 
late arising CAs. Of 225 lines, 77 (34%) were capable of supporting limited in vitro 

15 hematopoiesis, while, consistent with the initial hypothesis, only 2% were able to maintain 
long-term (>6 weeks) hematopoietic CA activity. Subsequent studies with a selected subset 
of these lines, showed that the ability to effectively support in vivo reconstituting BM stem 
cells is infrequently observed. (Wineman, et al, 1996). Two out of sixteen cell lines 
maintained significant levels of long-term reconstituting stem cell activity for an in vitro 

20 culture period of three weeks. Several other cell lines supported low levels of such activity 
or transiently repopulating stem cells. The cell inoculum, in these studies, was whole BM 
which was not enriched for stem cell activity. Therefore, it was not possible to suggest that 
the effective stromal cell lines were directly supporting stem cell activity. This example 
demonstrates that a single clonal cell line, designated AFT024, can maintain quantitative 

25 levels of transplantable stem cell activity present in highly purified stem cell populations. 
These data were generated using a competitive repopulation assay system, which employs 
uncompromised competitor BM cells. The in v/rro-maintained stem cells satisfy all criteria 
which currently define the most primitive stem cell population including the ability to 
reconstitute secondary recipients. This example also shows that the in vitro maintenance of 

30 primitive transplantable stem cells is compatible with the concurrent generation of large 
numbers of committed progenitors. 
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Methods 

Mkfix Timed-pregnant mice and 5-7 weeks old females (C57B1/6J, Ly5.2) were 
purchased from the Jackson Laboratory (Bar Harbor, ME). Congenic C57B1/6, LyS.l 
female mice were purchased from the National Cancer Institute (Frederick, MD). All mice 
were housed in the Princeton University Barrier Animal Facility, in autoclaved micro- 
isolator cages on ventilated cage racks. The animals received sterile, irradiated food, and 
acidified, autoclaved water ad libitum. 

Stromal cell lines and culture conditions. The fetal liver stromal cell lines used in 
this study were derived as previously described. (Wineman, et al., 1996). Stromal cell lines 
were routinely cultured in Dulbecco's modified Eagles's medium (DMEM) supplemented 
with 10% fetal bovine serum (FBS), 5X10' 5 mol/L b-mercaptoethanol (2-ME), at 32°C, 5% 
C0 2) 100% humidity. Sera were obtained from Hyclone, Logan, UT. Other biochemical 
reagents were obtained from Sigma, St. Louis, MO. Two of the lines used in this study 
(2012 and 2018) were previously characterized for their ability to support long-term 
repopulating activity present in whole unfractionated BM. (Wineman, et al., 1996). The 
AFT024 cell line was identified as an additional long-term (>4 weeks) CA supporter. 
Subclones of 2012 and AFT024 were isolated and used in these studies. The AFT024 cell 
line has remained stable and demonstrated consistent stem cell supporting abilities for over 4 
years. 

Hematopoietic stem cell purification. Stem cells were purified from day 14 fetal 
livers essentially as described, (Jordan, et al., 1990) with the inclusion of c-kit expression 
as an additional parameter. Briefly, AA4.T cells were isolated by immunopanning on 
petri dishes coated with AA4.1 antibody (10 ug/mL). The AA4.1 + fraction has been 
shown to contain all repopulating stem cell activity present in day 14 fetal liver. (Jordan, 
et ah, 1990) AA4.1 + cells were collected and stained with saturating concentrations of 
fluorescein isothiocyanate (FITC) labeled rat monoclonal antibodies to lineage markers 
(CD3, CD4, CDS, CD8, B220, Gr-1, Mac-1, and TER-119). The cells were 
simultaneously stained with phycoerythrin (PE) labeled Ly6A/E (Sca-1) antibody and 
biotinylated antibody to c-kit. The latter was developed with streptavidin 
allophycocyanin (APC). The AA4.1 hybridoma was a kind gift from Dr. J. McKearn, 
Monsanto, St. Louis, MO. AA4.1 antibody was purified by ImClone Systems Inc. New 
York, NY. The TER-119 antibody was initially obtained from Dr. T. Kina, Kyoto 
University, Japan and subsequently purchased from PharMingen, San Diego, CA. All 
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other antibodies were purchased from PharMingen. Streptavidin APC was purchased 
from Molecular Probes Inc., Eugene, OR. Three color fluorescence activated cell sorting 
for lineage negative to low (lin 710 ), Sca-T, c-kit + cells was initially done on a dual laser 
Epics 753 cell sorter (Coulter Electronics, Hialeah, FA) interfaced with Cicero software 
(Cytomation Inc., Fort Collins, CO) and subsequently on a multi-laser FACS Vantage 
with Cell Quest software (Becton Dickinson Immunocytometry Systems, San Jose, CA). 

Enriched fractions of BM stem cells were obtained from C57B1/6 LyS.l mice as 
described. (Okada, et al., 1992). Briefly, BM mononuclear cells were isolated by density 
centrifugation over Ficoll Hypaque (<L077, Pharmacia, Piscataway, NJ). Lineage 
negative or low staining cells (lin v, °) were obtained by magnetic bead depletion (anti-rat 
immunoglobulin coated beads, Dynal, Oslo, Norway) of mononuclear cells using the 
same lineage cocktail described herein above. The cells were further stained with 
antibodies to Sca-1 and c-kit as described herein above. Sorting for lin'" 0 , Sca-1 + , and c- 
kit + cells was accomplished with the Epics 753 as described above. 

Stem cell/stromal cell cocultivatio n and cvtokine-supplemented suspension 
Culture. Stromal cell lines were seeded on tissue culture dishes that had been coated with 
1% gelatin (Specialty Media, Lavallette, NJ) and were grown at 32°C, 5% C0 2 , 100% 
humidity. Confluent monolayers were irradiated (20 Gy, ,37 Cesium source, Gammacell 
40, Nordion International Inc. Ontario, Canada) and cultured in modified Dexter (Dexter, 
et al., 1984) media (DMEM, 10% FBS, 10% horse serum, 5X10" 5 mol/L 2-ME, 1X1 0" 7 
mol/L hydrocortisone). For Dexter-LTC, enriched hematopoietic stem cells were added 
and the cultures were maintained at 37° C, 5% C0 2 , 100% humidity with weekly media 
changes. The specific numbers of purified stem cells added to stromal cell cocultures are 
given in the appropriate figure or table legends. In some experiments, week 4 
AFT024/stem cell cultures were harvested and replated in limiting-dilution onto fresh, 
irradiated (20 Gy) AFT024 monolayers in 96-well trays (Dexter-LTC conditions). CAs 
were scored weekly (as described above), for an additional 5 weeks. Irradiated (20 Gy) 
2018 monolayers in 96-well trays were used in limiting-dilution Whitlock-Witte assays 
(LD-WW) (Whitlock, et al., 1982) to assess stromal-dependent B-lymphopoiesis content 
of both freshly purified and AFT024 cultured fetal liver stem cells. These cultures were 
established in RPMI media with 5% FBS, 2 mmol/L glutamine, 1 mmol/L Na pyruvate, 
and 5X10" 5 mol/L 2-ME at 37° C, 5% C0 2 , 100% humidity. 2018 has been identified as 
a potent B-lymphopoiesis supporting line in W-W conditions. (Deryugina, et al., 1994). 
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Short-term cytokine-supported suspension cultures and short-term AFT024/stem cell 
cocultures were established in Iscove's Modified Dulbecco's Media (IMDM), 10% FBS, 
1% BSA, 5X10" 5 mol/L 2-ME at 37°C, 5% C0 2 , 100% humidity. Cytokine 
concentrations: rmflk2/flt3-ligand (FL) 30 ng/mL, rmSteel factor (SL) 20 ng/mL, rhIL-6 
5 10 ng/mL. FL was obtained from ImClone Systems Inc.; SL was purchased from 
Genzyme Corporation, Cambridge, MA. IL-6 was purchased from Upstate 
Biotechnology Inc., Lake Placid, NY. 

Transplantation assays for hematopoietic stem cell activity . Competitive repopulation 
was used to measure stem cell activity present in both freshly isolated and cultured stem 

10 cell populations. (Harrison, et al., 1993). This assay was performed using the congenic 
Ly5. 1/5.2 mouse system (Morse, et al., 1987). Enriched stem cells were seeded onto 
irradiated stromal monolayers and maintained in Dexter-LTC conditions. At the end of 4- 
7 weeks, the cultures were harvested by vigorous trituration. Single cell suspensions 
were prepared by passage through 22-gauge needles, mixed with fresh congenic BM and 

15 transplanted into lethally irradiated congenic mice (10 Gy, split dose 3 hours apart, 1 
Gy/min, Gammacell 40). All purified fetal liver stem cells were from Ly5.2 mice. The 
competitor BM and recipients were LyS.l. Purified BM cells were from LyS.l mice; in 
this experiment, Ly5.2 BM was used as competitor and Ly5.2 mice were used as 
recipients. In order to assess reconstitution, mice were periodically bled by capillary 

20 puncture of the orbital venous plexus. Blood (0.1 mL) was collected into heparin- 
containing (10 U/mL) DMEM and the red blood cells were lysed with NH 4 C1 (Mishell, et 
al., 1980). For the experiments described in Figure 15 and Table 4., the nucleated cells 
were divided into two fractions and stained with the appropriate biotinylated Ly5 
antibody and developed with streptavidin-peridinin chlorophyll protein (PerCP) and; (1) 

25 CD4-FITC, CD8-PE and (2) B220-FITC, Mac-l-PE, Gr-l-PE. Cells from each fraction 
were analyzed on an Epics Profile II, Coulter Electronics. For the experiments described 
in Figure 16 and Tables 2. and 3., nucleated cells were stained with directly conjugated 
lineage antibodies (CD4-PE, CD8-PE, Mac-l-FITC, Gr-l-FITC, and B220-APC) and 
biotinylated Ly5.2 antibody which was developed with streptavidin Texas Red (T.R.). 

30 Four color analysis of stained cells was performed on either the Coulter Epics 753 or 
Becton Dickinson FACS Vantage with the appropriate software interfaces described 
above. Anti-Ly5.1 was a kind gift of Dr. H. Nakauchi, University of Tsukuba, Japan. 
Purified and biotinylated Ly5.2 antibody was originally obtained from a hybridoma 



WO 00/11168 



61 



PCT/US99/19052 



(AL14A2) kindly provided by Dr. G. Spangrude, University of Utah Medical Center, Salt 
Lake City, UT. In later experiments, the Ly5.2 antibody was purchased from 
PharMingen; CD4-PE, CD8-PE, and B220-APC were also purchased from PharMingen. 
Streptavidin-PerCP was purchased from Becton Dickinson Immunocytometry Systems. 
5 Streptavidin T.R. was purchased from Molecular Probes. Competitive repopulating units 
(CRU) per 10 5 were calculated according to the formula: (Harrison, et ah, 1990). 

CRU/10 5 = %Lv5 positive cells X cell number competitor BM 
1 00-%Ly5 number of test cells 

10 

Reconstitution values of less than 2% of the test Ly5 donor allele were not considered 
sufficiently above background for calculation of CRU. 

Retransplantation potential of stem cells was assessed by secondary 
transplantation. Mice from the experiment presented in Figure 14. were sacrificed 60 

15 weeks after transplantation, BM was harvested and stained with antibody to Ly5.2 
followed by streptavidin T.R. Ly5.2 + cells were collected by cell sorting (Coulter Epics 
753) and used to transplant lethally irradiated secondary LyS.l recipients. Marrow from 
mice in the Control and AFT024 groups was used to transplant mice in both 
radioprotection and competitive repopulation assays. Ly5.2 + BM from the primary 2012 

20 transplants was used only in radioprotection assays. Primary 2018 mice did not contain 
sufficient Ly5.2 + cells for secondary transplantation. Transplanted mice were bled and 
analyzed by 4-color flow cytometry for the presence of Ly5.2 + cells and multilineage 
reconstitution as described above. 

In vitro hematopoietic progenitor cell assays . The progenitor content of freshly 

25 purified hematopoietic stem cell populations and AFT024/stem cell cocultures was 
assessed using a variety of in vitro assays. All of the following assays were accomplished 
with fetal liver stem cells enriched as described herein above. To determine the time 
course of CA development, enriched stem cells were seeded onto irradiated AFT024 
monolayers (300-600 cells/well in 12-well trays). CA development was followed over 

30 time and characteristic clusters were quantitated as described above. At different time 
points of the stem cell/AFT024 cultures, individual wells were harvested and replated into 
cytokine-supplemented semisolid clonogenic progenitor assays (CFU-C). The cytokine- 
enriched (rmIL-3 10 ng/mL, rhIL-6 10 ng/mL, rmSL 50 ng/mL, Epo 3 U/mL) methyl- 
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cellulose mixture was purchased from Stem Cell Technologies Inc., Vancouver BC, 
Canada. Colonies were scored after 8-14 days of culture at 37°, 5% C0 2 , 100% humidity 
according to established criteria. (Testa, et al., 1993). Colonies that reached >lmm in 
size after 8 days and which contained erythroid bursts and multiple myeloid cell lineages 
5 including megakaryocytes were scored as high-proliferative potential-mixed lineage 
colonies (CFU-HPP-Mix). (Lowry, et al., 1995). Lineage content of typical colonies was 
determined by Wright's/Giemsa staining of cytospin slide preparation from individual 
colonies. Colony assays were also done with 10 3 freshly purified cells. The CFU 
progenitor contents of the AFT024 cocultures were normalized to an initial input of 10 3 

10 stem cells. To assess the ability of the AFT024 cell line to maintain primitive lymphoid 
progenitors, 4 week cocultures were plated into LD-WW assay on 2018 cells as described 
above. Resulting pro-B cell colonies were scored after 7 days. The cell number in 
individual wells (96-well trays, 8 wells/cell number) was normalized from the original 
number of purified stem cells that initiated the coculture, i.e. stem cell equivalents/well. 

15 As calculated from the line of best fit, the cell number at 37% negative wells is the 
frequency of pro-B cell colony initiating cells in the starting population. (Taswell, 1981). 
In a similar manner, the frequency of CA initiating cells in week 4 stem cell/AFT024 
cocultures was also determined by replating them in limiting-dilution onto fresh, 
irradiated AFT024 monolayers in 96-well trays. CA were scored as described above, at 

20 1,2, 3, 4, and 5 weeks after replating in Dexter-LTC. The resulting frequencies were 
calculated as described herein above for the LD-WW assays and are also expressed in 
relationship to the number of stem cells that seeded the initial cultures (stem cell 
equivalents). 
Results 

25 In vivo and in vitro a ssays for stem cell activity maintained by stromal cell lines . 

Highly enriched stem cell populations were used to initiate cultures supported by single 
stromal cell lines. The focus was on one cell line, AFT024, which exhibited particularly 
potent stem/progenitor cell support. Two other stromal cell lines, 2012 and 2018, 
(Wineman, et al., 1996) were included in some experiments. In order to more rigorously 

30 establish the clonality of these lines, they were subcloned by limiting-dilution. All 
subclones obtained from a given cell line contained the same proviral integrant position 
as the parental cell line. The AFT024 cell line was evaluated both for its ability to 
maintain in vivo competitive repopulating stem cells, as well as a broad spectrum of 
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stem/progenitor cells defined by a variety of in vitro assays. The in vivo assays focused 
on the ability of stem cells, cultured for extended time periods (4-7 weeks), to 
permanently reconstitute multi-lineage hematopoiesis in transplanted hosts. The in vitro 
assays included the enumeration of CA appearing over time in the initial cultures as well 
5 as the quantitation of stem/progenitor cells which can form colonies in cytokine- 
supplemented replating assays. Cells from four-week AFT024/stem cell cocultures were 
also assayed by limiting-dilution for the content of progenitors capable of initiating 
secondary CAs on AFT024 or B-lymphopoiesis in Whitlock-Witte cultures supported by 
2018. 

10 AFT024 maintains quantitative levels of long-term in vivo repopulating stem cell 

activity. One line of investigation inquired if and at what levels in vivo transplantable 
stem cell activity was present in four to seven week-old cultures initiated with highly 
enriched stem cells and supported by AFT024, 2012 or 2018. Purified day 14 fetal liver 
cells (AA4.1 + , lin /!o , Sca-1 + , c-kit + ) and adult BM cells (lin /l0 , Sca-1 + , c-kit + ) were used as 

15 sources of stem cell activity. Both of these populations are about 1000 to 1500-fold 
enriched for stem cell activity, as measured by competitive repopulation. (Harrison, et al., 
1993). The Ly5.1/Ly5.2 congenic system was utilized for all competitive repopulation 
studies. (Morse, et al., 1987). The data presented in Figure 14 demonstrate that the 
cultures supported by AFT024 contain stem cell activity at levels quantitatively identical 

20 to those present in the uncultured purified populations. In this experiment, individual 
LyS.l mice received 10 3 freshly purified Ly5.2 cells or the cultured equivalent of 10 3 
purified Ly5.2 cells. Each mouse also received 10 6 LyS.l competitor BM cells. The 
percentage of Ly5.2 positive peripheral blood cells was approximately equal in both 
groups of recipient animals. Moreover, the cultured stem cell activity is as effective as 

25 freshly purified activity for in vivo periods of greater than one year. The data in Figure 14 
also show that the 2018 cell line is completely ineffective in maintaining highly purified 
stem cell activity while the 2012 cell line supports intermediate levels of repopulating 
activity. The data presented in Table 4A. provide quantitative competitive repopulating 
unit (CRU) value calculations as well as the results of multiparameter lineage analyses. 

30 The extremely low levels of reconstitution by 2018-cultured stem cells precluded lineage 
analysis. The CRU values of the AFT024-cultured and freshly purified populations are 
nearly identical. Moreover, both fresh and AFT024-cultured stem cells reconstitute 
myeloid and lymphoid cell populations to a similar degree. In order to further access the 
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supporting activities of AFT024 and 2012 we utilized a 10-fold lower number of fetal 
liver stem cells from two separate purifications to initiate the cocultures. The cultures 
were continued for 4-7 weeks, harvested, and used in competitive repopulation studies. 
Each recipient received the cultured equivalent of 100 purified Ly5.2 stem cells plus 
5 4X10 5 LyS.l competitor BM cells. A total of twelve mice were transplanted with 
AFT024 cocultures ( four each after four, five and seven weeks of LTC). The parental 
AFT024 line was used in the 4 week group and two different subclones were used to 
support the five and seven week cultures. The AFT024-cultured Ly5.2 stem cells 
contributed to 20-30% of peripheral blood cells in these recipients while cells cultured on 

10 2012 demonstrated more limited in vivo function (Table 4B). The 2012 cultures were 
done with two subclones of the parental line and were maintained for four weeks prior to 
harvest and transplant (four mice/subclone). The data utilizing different cultures time or 
sublcones did not vary significantly from each other and are presented together in Table 
4B. An additional experiment was undertaken using enriched BM which was cultured on 

15 AFT024 and 2018 for six weeks. In this experiment, BM was purified from LyS.l 
congenic mice. Each Ly5.2 recipient mouse in this study received 100 freshly purified 
cells, or the cultured equivalent of 100 purified cells. Both groups received 10 5 LyS.l 
competitor BM cells per mouse. Data analysis, for the presence of Ly5.1 + cells at 4 
months after transplant, is presented in Table 4C. For 6 weeks of culture, AFT024 cells 

20 maintained quantitative levels of reconstituting activity present in 100 purified BM stem 
cells. The 2018 cell line failed to maintain stem cell activity. 

These studies were extended to include secondary transplantation as an additional 
assay for primitive stem cells. BM cells were harvested from the primary recipients of 
fresh and cultured fetal liver stem cells (see Figure 14. and Table 4 A.) and the Ly5.2 

25 positive, fetal liver-derived fraction was collected by cell-sorting. Secondary 
radioprotection and competitive repopulation transplants were performed. The data are 
presented in Table 5. The secondary recipient repopulating activities are nearly identical 
for the AFT024 cultured stem cells and the non-cultured controls. Lineage analysis of the 
Ly5.2 cells in the secondary recipients revealed similar numbers of myeloid and 

30 lymphoid cells derived from both AFT024-cultured and non-cultured stem cells. Some 
level of secondary reconstituting cell activity was observed. 

Additional experiments were also performed to determine where the levels of 
stem cell activity present in long-term AFT024 cocultures were compared to those present 
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in short-term cytokine stimulated cultures or in short-term AFT024-supported cultures 
(Figure 15). Purified fetal liver cells were seeded onto an AFT024 monolayer and 
maintained in Dexter-LTC conditions for 5 weeks. Simultaneously, the same numbers of 
purified cells were cultured for 5 days with; (1) different cytokine combinations or (2) on 
5 AFT024. The transplantable activity in the cultured cells was then assayed by 
competitive repopulation. Each mouse received the cultured equivalent of 600 stem cells 
together with 4X1 0 5 LyS.l competitor BM cells. It is evident from the data that the levels 
of in vivo repopulating activity present in the long-term AFT024-supported cultures are 
much greater than those remaining after a short-term cytokine-supported culture period. 

10 Of interest also is that short-term AFT024 stem cell cocultures do not maintain significant 
in vivo reconstituting activity. In fact, these levels of stem cell activity are identical to 
the levels seen in the cytokine-supported cultures. 

In vitro stem/progenitor populations are expanded by AFT024. AFT024/stem cell 
cocultures have vigorous hematopoiesis throughout the entire in vitro culture period. 

15 This is reflected in the large numbers of relatively mature hematopoietic cells which are 
produced throughout the culture period. In addition, CA colonies are observed 
throughout the culture period. Figure 16 shows a time course of CA appearance with 
purified fetal liver stem cells (3 separate experiments). After 28 days in culture, 
approximately one in every twenty input stem cells is capable of proliferating into a CA. 

20 In addition, CA appearance over time follows a biphasic distribution, with many CA 
observed early in the culture period. In order to enumerate the various classes of 
stem/progenitor cells present in AFT024 cocultures, we performed a series of in vitro 
replating experiments. These included the quantitation of: (1.) progenitor cells capable 
of colony-formation in cytokine-supplemented semisolid cultures (CFU assay), (2.) 

25 progenitor cells capable of initiating secondary CA in limiting-dilution AFT024 cultures 
and (3.) progenitor cells which can initiate B-lymphopoiesis in LD-WW cultures. In all 
of these experiments the primary cultures were initiated with purified fetal liver cells. All 
data presented below are normalized to an initial input of 10 3 purified cells (CFU assay) 
or the actual number of initial input stem cells (limiting-dilution assays). 

30 Shown in Figure 17 are the numbers and types of cytokine responsive CFU 

progenitors present at various times in the AFT024-supported cocultures. Production of 
CFU is evident at all time points. However, the content is especially high after four 
weeks, representing a 5-7 fold increase/expansion when compared to the content in the 
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freshly purified populations. The content of more primitive progenitors (CFU-HPP-Mix) 
is increased by 12-fold. These HPP-Mix colonies often reach a size of 2 mm in 8 days 
and contain large numbers of erythroid bursts and megakaryocytes. Interestingly, there 
does not appear to be a correlation between CA number and CFU content at different 
5 culture times. This is most apparent at day 6, when CA numbers are at their peak but the 
progenitor content is similar to that observed in non-cultured stem cells. Furthermore, 
there is no correlation between CFU content and the absolute numbers of maturing 
hematopoietic cells present in a given culture. 

Next, the content of primitive B-Iymphoid progenitors present in the 

10 AFT024/stem cell cultures was determined. This was accomplished by plating cells from 
the four-week AFT024 cocultures into LD-WW assays over the 2018 stromal cell line. 
Two experiments with freshly purified stem cells and AFT024-cultured stem cells 
showed that the frequency of pro-B cell progenitors is expanded 10-fold in AFT024 
cultures compared to the frequency observed in the freshly purified input population (day 

15 0 frequency I in 1 1.0, r 2 = 0.98; day 28 AFT024-cultured frequency 1 in 1.1, r 2 = 0.97). 

In order to measure the content or frequency of progenitor cells capable of 
initiating secondary CA, 4 separate, four-week AFT024/stem cell cocultures were 
replated in limiting-dilution onto fresh AFT024 monolayers. CAs were scored after one 
week. The data are presented in Figure 18 A. Large numbers of secondary CAs were 

20 observed. When normalized to the stem cell numbers used to initiate the primary cultures 
(stem cell equivalents, see Methods), the frequency of these progenitors is 1 in 3 to 4. 
Figure 18B. shows data from one of the 4 experiments presented in Figure 18A., where 
the quantitation of secondary CAs was extended for 4 more weeks. The frequency of CA 
decreases slowly over time (1 in 19 after an additional 4 weeks), approximating the 

25 frequency seen in the primary cultures at four weeks. In summary, our in vitro replating 
assays collectively demonstrate a significant expansion of primitive progenitor 
populations in 4 week AFT024 cultures. In these same cultures there is no decrease in the 
levels of transplantable stem cell activity present in the total hematopoietic cell 
population. 

30 Discussion 

In this example, it was demonstrated, that the AFT024 stromal cell line can 
maintain quantitative levels of in vivo repopulating stem cells for at least 7 weeks of in 
vitro culture. Highly enriched stem cell populations in low numbers (100 cells) were 
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used and cell activity was measured in a stringent, competitive repopulation assay 
system. The cultured stem cell activity satisfies all in vivo criteria normally ascribed to 
the most primitive stem cell compartment; (1) long-term engraftment ability, (2) 
multilineage potential and (3) the ability to repopulate secondary recipients. In addition, 
5 the studies with the low culture initiating stem cell numbers for both BM and fetal liver, 
imply that AFT024 stromal cells exert their supportive effects in a direct manner. These 
studies represent a clear example of an in vitro system capable of directly supporting the 
most primitive stem cell compartment. 

Other data revealed that stromal cell lines isolated from a single tissue source are 

10 heterogeneous with respect to their abilities in maintaining long-term repopulating stem 
cells. (Wineman, et al., 1996). It was speculated that the rare cell lines which were 
effective in supporting in vivo reconstituting stem cells may represent immortalized 
components of in vivo stem cell niches. However, the data argue for the necessity of 
using purified stem cell populations in order to support such a hypothesis. Specifically, 

15 the 2018 cell line maintained transiently reconstituting activity present in unfractionated 
BM. However, 2018 fails to maintain measurable repopulating activity when cultured for 
4-6 weeks with highly purified BM or fetal liver stem cells. Similarly, in experiments 
using purified cells, there was a failure to show robust levels of reconstituting stem cell 
activity in cultures supported by CFC034, the most effective cell line in the whole BM 

20 studies (Wineman, et al., 1996). The 2012 cell line which was reported (Wineman, et al., 
1996) to be effective in maintaining long-term repopulating whole BM derived stem cells 
is only partially effective in the present studies. Moreover, only some subclones of 2012 
display such activity (in spite of identical proviral integration positions in all subclones). 
Recently, studies have shown that the SI 7 cell line which consistently supports the stem 

25 cell activity present in whole BM, (Wineman, et al.,. 1993) is not similarly effective in the 
maintenance of purified BM stem cells. (Szilvassy, et al., 1996), Taken together with 
these current data, the previously observed stem cell supporting stromal cell activities 
may reflect the action(s) of indirect mechanisms and therefore do not permit the 
identification of cellular stem cell niche components. One previous study has shown that 

30 the Sysl stromal cell line can maintain high levels of transplantable activity present in 
purified BM. (Szilvassy, et al., 1996). The competitor cells in that study were 
compromised by prior serial transplantation. Moreover, the culture period was extended 
for only two weeks and effective maintenance required the addition of exogenous 
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leukemia inhibitory factor. In contrast, AFT024 is a cell line that provides a direct-acting 
long-term stem cell supporting environment without the addition of exogenous factors. 

In addition to recovering net input levels of transplantable activity from AFT024 
supported cultures, significantly expanded populations of primitive progenitor cells were 
5 also obtained. CFU-HPP-Mix progenitors are expanded by 12-fold after 4 weeks of 
culture and the numbers of stromal-dependent pro-B lymphoid progenitors are similarly 
amplified. This suggests that the AFT024-mediated process of stem cell maintenance is 
in reality a dynamic phenomenon. Specifically, during the first portion of the culture 
period, the majority of transplantable stem cell activity may be lost, through 

10 differentiation or cell death. The remaining primitive stem cells may expand to yield 
input levels of transplantable activity as well as increases in the numbers of more 
committed progenitors. One hypothesis is that short-term AFT024 supported cultures 
should contain reduced levels of transplantable stem cell activity. Figure 1 7 support this 
hypothesis. The standard Dexter-type media used in parallel long-term AFT024 

15 cocultures was not used in these short-term cultures. However, in another short-term 
experiment, utilizing Dexter-LTC media, a similarly dramatic reduction in stem cell 
activity was observed after 4 days of culture on AFT024. 63 These observations are 
intriguing because they suggest that the AFT024 cell line is able to facilitate some degree 
of ex vivo transplantable stem cell proliferation and expansion. Indeed, in other studies it 

20 was shown that AFT024 can support colony formation initiated by single purified stem 
cells with B and T-lymphoid, myeloid and erythroid potentials. Moreover, the data 
suggest that is possible to efficiently introduce retroviral markers into transplantable stem 
cells at various times during AFT024 cocultures. Extension of such marking experiments 
and an analysis of proviral integration patterns will be necessary to rigorously ascertain if 

25 self-renewal replication is occurring during these coculture periods. 

The ability of AFT024 to maintain the most primitive stem cell compartment 
while generating and expanding at least some less primitive members of the 
stem/progenitor cell hierarchy raises interesting issues regarding the nature of stem cell 
niches. The present invention suggests that microenvironmental niche models which 

30 postulate distinct cellular entities responsible for stem cell self-renewal and other cellular 
entities which support the generation of committed progenitor cells may be overly 
simplified. (Uchida, et al., 1993). Quite clearly a single microenvironmental cell type 
represented by AFT024 is sufficient for keeping stem cells in an undifferentiated state as 
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well as allowing commitment and progenitor expansion to take place. A hallmark feature 
of a stem cell niche is the ability to facilitate the generation of the entire stem/progenitor 
cell hierarchy from very primitive cells. Therefore, the main functional role of such 
niches may be to provide an environment which permits the production of the correct 
5 numerical balance of more and less primitive stem/progenitor cell entities. This model 
contains several testable hypotheses. The most important is that in vitro stem cell 
maintenance should not be interpreted, literally, as the maintenance of quiescent cells but 
rather as a phenomenon which results from a balance of self-renewal and commitment 
decisions which occur during stem cell division. 

10 A cytokine cocktail of IL-6, SL, and FL is not effective in maintaining fetal liver 

stem cell activity. It was shown that RNA transcripts for these and 10 other cytokines are 
present in AFT024, but they are also detected at similar levels in non-supporting lines 
such as 2018. (Wineman, et al., 1996). These observations suggest the existence of novel 
AFT024-derived molecules which may act on stem cells. Indeed, using a subtractive 

15 hybridization molecular cloning strategy, a number of candidate molecules have been 
identified. Two of these molecules contain EGF-like repeat motifs that are most closely 
related to those found in the Notch/Notch-ligand family. Interestingly, one of these 
molecules appears to have activity on primitive stem cell populations. (Moore, et al., 
1997). (See, Examples presented herein above. 

20 Table 4. Multilineage stem cell activity, high CRU levels maintained in AFT024 cultures 
A. With 1000 purified fetal liver cells 







Lineage Contribution, % Ly 5.2 Cells 


Total 




Cells 


Week 


CD4 


CD8 


B220 


Myeloid 


% Ly 5.2 


CRU/10 5 


Control 


5 


3.3±1.5 


4.4+1.7 


48±5.1 


32+6.0 


28±1.3 


386±24 




12 


27±10 


24+8.7 


63±8.9 


43±9.2 


44±3.2 


800±102 




24 


42±6.9 


34+4.1 


71±4.4 


62±1.4 


48±4.8 


984±171 


AFT024 


5 


7.7±6.2 


6.0±3.5 


58±6.6 


36±4.7 


39±2.1 


641±54 




12 


44±7.1 


38±6.4 


70±5.6 


53±3.6 


48+1.4 


942±51 




24 


56±3.8 


49±3.1 


72±2.4 


64±3.7 


56±1.9 


1270±96 


2012 


5 


0.8±0.5 


1.4±0.3 


6.8±3.6 


14±11 


7.4±2.3 


52±8.5 




12 


9.1±8.4 


10±6.9 


14±9.3 


23±20 


14±4.5 


194+68 




24 


17±3.8 


18±2.0 


23±5.9 


32±6.4 


21±4.4 


196±71 
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B. With 100 purified fetal liver cells 







Lineage Contribution, % Ly5.2 Cells 


Total 




Cells 


Week 


Granulocytes 


B-cells 


T-Cells 


% Ly5.2 


CRU/10 S 


AFT024 


15 


33±4.6 


41+6.0 


25±5.3 


28±4.6 


1888±315 




26 


17±4.4 


18±4.8 


18±5.9 


24±5.9 


1825±571 




46 


22±5.7 


19±6.6 


15±6.7 


20±6.5 


1595±628 


2012 


15 


32±7.8 


28±7.2 


20+5.2 


24±6.6 


1390±382 




26 


10±2.8 


9.7±3.2 


9.7±3.2 


14±3.9 


699±176 




46 


11+2.3 


6.7±1.6 


8.9±3.2 


9.0±2.0 


409± 80 


C. With 100 purified bone marrow cells 




No. of 


Lineage Contribution, % Ly 5.1 Cells 


Total 




Cells 


mice 


CD4 CD8 


B220 


Myeloid 


%Ly5.1 


CRU/10 5 


Control 


8 


38+7.3 30+5.8 


43±6.7 


31±4.8 


32+6.5 


583±172 


AFT024 


7 


20±9.8 18±8.8 


20±8.7 


24±11 


22±10 


490±296 


A. AA4.1 day 14 fetal liver cells (Ly 5.2) were : 


further purified for a lin" "", Sca-1 ', ckit' stem cell surface 


phenotype. Fresh purified control cells, 10 5 , were transplanted with 10 6 Ly5. 1 competitor marrow ( 


n=6 


mice). From the same purification, 10 cells were cocultured with stromal cell lines for 4 weeks. 





Subsequently, 10% of each culture was transplanted per mouse (n=8 mice/stroma) together with 10 6 
competitor BM cells. The contribution to each lineage in peripheral blood is expressed as the percent of the 
total specific lineage population that was Ly5.2 + . CRU/10 5 , relative enrichment of competitive 
repopulating units. Data are presented +SEM. 

B. The lineage and CRU content of low numbers of enriched fetal liver stem cells maintained on AFT024 
and 2012 were determined. 500 stem cells were maintained in Dexter-LTC for 4-7 weeks over irradiated 
monolayers. 20% of each culture was used to transplant groups of 4 mice (i.e. each mouse received the 
equivalent of 100 stem cells that initially seeded the cultures) combined with 4X10 5 competitor BM cells. 
Data are presented as ±SEM. 

C. Ly 5.1 BM cells with a lin*' 10 , Sca-T, and c-kit + cell surface phenotype were purified. 100 fresh cells 
per mouse (Control) were transplanted with 10 5 Ly 5.2 competitor BM cells. One thousand of the same 
purified cells were cocultured with stromal cell lines for 6 weeks. The cultures were then harvested and 
10% of each culture was transplanted per mouse with competitor. Data are from peripheral blood samples 
taken 4 months after transplant and are presented +SEM. 
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Table 5. LTRSC maintained on AFT024 are able to repopulate secondary recipients at 
levels comparable to non-cultured stem cells. 



Group 


Weeks 


% Ly 5.2 T peripheral blood cells 
Radioprotection Comp. 

Repopulation 


Control 


6 


21(1) 


2.6±0.6 (4) 




22 


54 (1) 


5.6±1.6 (4) 


AFT024 


6 


13±0.2 (4) 


1.3±0.9 (8) 




22 


44±15.1 (3) 


4.3±0.9 (8) 


2012 


6 


6.7±2.9 (4) 


ND 




22 


14±2.2 (4) 


ND 



The retransplantation potential of LTRSC in primary recipients of stromal cell cultured 
5 stem cells was studied in secondary recipients. 60 weeks after transplant, primary mice 
(see Figure 14, Table 4A.) were sacrificed, BM harvested and stained with antibody to 
Ly5.2. Ly5.2 + cells were collected by cell sorting and used to transplant secondary 
recipients (congenic Ly5.1 mice). Control and AFT024 groups were transplanted with 
1.5X10 6 Ly5.2 cells/mouse for radioprotection (4 mice/group were transplanted) and 
10 7.5X1 0 5 Ly5.2 cells + 7.5X1 0 5 Ly5.1 cells for competitive repopulation (4 mice for the 
Control group and 8 mice for the AFT024 group). 2012 mice were transplanted with 
3X10 5 Ly5.2 cells/mouse (4 mice). Weeks are the times after transplant that the mice 
were analyzed, (n), number of mice surviving/group; ND, not done. Data are presented 
±SEM. 

15 

EXAMPLE 4: Hematopoietic activity of a stromal cell transmembrane protein 
containing epidermal growth factor-like repeat motifs 

Primitive hematopoietic stem cells are closely associated with discrete in vivo 
microenvironments. These "niches" are thought to provide the molecular signals that 
20 mediate stem cell differentiation and self renewal. The fetal liver microenvironment was 
dissected into distinct cellular components by establishing an extensive panel of stromal cell 
lines. One particular cell line maintains repopulating stem cells for prolonged in vitro 
culture periods. A subtraction cloning strategy has yielded a cDNA which encodes a cell 
surface glycoprotein with a restricted pattern of expression among stromal cell lines. This 
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molecule, previously identified as delta-like/pre-adipocyte factor- 1, contains epidermal 
growth factor-like repeats which are related to those in the notch/delta/serrate family of 
proteins. The potential role of this molecule in hematopoietic stem/progenitor cell regulation 
was investigated. The delta-like protein displays activity on purified stem cells by promoting 
5 the formation of "cobblestone areas" of proliferation. These cobblestone areas contain both 
primitive high-proliferative potential progenitors and in vivo repopulating stem cells. 
Introduction 

The positive and negative regulatory mechanisms that govern the proliferation, self- 
renewal and differentiation of primitive hematopoietic stem cells are complex and poorly 

10 understood ( Ogawa, M. (1993) Blood 81, 2844-2853). Numerous cytokines have been identified 
that, when used in vitro appear to act directly on purified stem cells by promoting 
proliferation and differentiation. However, attempts to demonstrate the in vitro maintenance 
and/or expansion of transplantable pluripotent stem cells using defined cytokine 
combinations have been largely unsuccessful (Knobel, K. M., et al., 1994; Peters, S. O., et 

15 al., 1995). Moreover, it is unclear to what extent any currently identified cytokines reflect 
mechanisms that are responsible for regulating normal, in vivo, stem cell behaviors. It is 
widely accepted that in vivo, stem cells are intimately associated with discrete 
microenvironmental "niches" (Wolf, N. S. 1979). Such niches are likely sources for the 
molecular signals which collectively mediate the differentiation and self-renewal of stem 

20 cells. Indeed, it has long been possible to demonstrate that preestablished stromal cell 
monolayers derived from hematopoietic tissues can support long-term hematopoiesis in vitro 
( Dexter, T. M., Allen, T. D. & Lajtha, L. G. 1977). The long-term nature of these cultures, 
together with the continuous production of committed progenitor cells suggest that both self- 
renewal and commitment decisions can occur //; vitro. At the cellular level, the 

25 hematopoietic microenvironment consists of numerous distinct cell types. Previous studies 
have shown that this cellular heterogeneity reflects a similarly broad heterogeneity in terms 
of hematopoietic supportive abilities (Deryugina, E. I., et al., 1994). Some cloned stromal 
cell lines can support stem cell activity in vitro, while others are ineffective. Similarly, 
distinct stromal cell types appear to influence the outcomes of stem cell differentiation 

30 processes (Friedrich, C, et al., 1996). Recent studies have shown that stromal cell lines that 
efficiently maintain long-term transplantable stem cells in vitro for prolonged intervals 
represent a small fraction of the total stromal cell population ( Wineman, J., et al., 1996). A 
fetal liver stromal cell line, AFT024 was identified which maintains high levels of 
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transplantable multilineage stem cell activity for extended in vitro culture periods (Moore, 
K. A., Ema, H. & Lemischka, I. R. (1997) (See, Example 3). The stem cells used to initiate 
these cultures are highly purified. It is, therefore, likely that in this system the mechanisms 
that mediate stem cell maintenance do so by acting directly on primitive stem cells. A 
different fetal liver-derived cell line, 2018, fails to maintain long-term repopulating stem cell 
activity. A PCR-based RNA expression analysis of 13 cytokines reveals qualitatively 
identical expression patterns in AFT024 and 2018. It wastherefore, hypothesized that the 
hematopoietic supportive ability of AFT024 is, at least in part, mediated by novel gene 
products not expressed in 2018. 

Using a subtractive hybridization approach a number of cDNA clones were identified 
which were specifically expressed in AFT024 but not in 2018. The sequence of one cDNA 
was identical to a molecule that encodes a transmembrane protein that contains six epidermal 
growth factor (EGF) repeat motifs. The EGF-like repeat sequences of this molecule, 
variously known as delta-like (dlk) ( Laborda, J., et al., 1993), preadipocyte factor- 1 (Smas, 
C. M. & Sul, H. S, 1993), and stromal cell protein- 1 (Genbank, D 16847), are most closely 
related to those present in the notch/delta/serrate family of signaling proteins. In Drosophila 
and Caenorhabditis, these molecules are required for correct cell-fate specification decisions 
in a variety of tissues (Rebay, I., et a!., 1991). Vertebrate homologs of the 
notch/delta/serrate family have been identified (Ellisen, L. W., et al., 1991; Bettenhausen, B., 
et al., 1995; Lindsell, C. E, et al.,1995). While the exact functional relationship of dlk to the 
activities of this family of molecules is unclear, in one in vitro study, it has been shown to 
block adipocyte differentiation ( Smas, C. M. & Sul, H. S, 1993). No studies have been 
reported that demonstrate a hematopoietic function for dlk. Expression analyses and these 
observations show a limited temporal pattern of dlk expression during murine fetal 
development which coincides with the time period of hematopoietic stem cell expansion 
(Smas, C. M. & Sul, H. S, 1993). 

Functional studies were undertaken to determine if dlk can act as a hematopoietic 
regulator. This molecule affects highly enriched stem cell populations by promoting 
"cobblestone area" (CSA) colony formation in dexter-type stromal cocultures. These CSA 
colonies contain an expanded population of primitive, high proliferative potential myeloid- 
erythroid progenitors. These cultures also contain stem cells capable of in vivo engraftment 
at levels equivalent to those present in parallel AFT024 supported cultures. It is proposed 
that dlk represents one molecular component responsible for the hematopoietic supportive 
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ability of AFT024. As such, dlk may define a novel molecular pathway of stem cell 
regulation by the microenvironment. 

Materials and Methods 

Cfill, Lings and Culture. The fetal liver stromal cell lines used in this study were 
5 derived as previously described (Wineman, J., et al., 1996). Cells were routinely cultured in 
DMEM containing 10% fetal bovine serum (FBS) and 50 uM b-mercaptoethanol (2-ME), 
and maintained at 31°-33°C, 5% C0 2 . For long-term cocultures with hematopoietic stem 
cells, confluent monolayers were irradiated (20 Gy), placed in modified Dexter media 
(DMEM, 10% FBS, 10% horse serum, 50 uM 2-ME, 0.1 uM hydrocortisone) and 
10 maintained at 37° C, 5% C0 2 with weekly media changes. NIH3T3 cells were obtained from 
ATCC. 

dlk Expres sion Analysis . Total RNAs from stromal cell lines were poly A + selected, 
Northern blotted, and hybridized to 32 P-labeled probes according to standard protocols 
(Sambrook, J M Fritsch, E. F. & Maniatis, T, 1989). A 600 bp dlk-cDNA clone from the 

15 AFT024 subtracted library was used as a probe. cDNA templates for RT-PCR were 
prepared according to manufacturers 1 protocols (GIBCO/BRL). Oligonucleotide primers 
were: sense 5'- GACCCAGGCTGCCCC-3' (SEQ.ID.No.:85) and antisense 5'- 
GGTACTCTTGTTGAG-3 1 (SEQ.ID.No.:86). For analysis of dlk expression at the protein 
level, antisera specific for dlk was generated by immunizing rabbits with a Flag-dlk fusion 

20 protein (described below). Resultant antibodies were purified by affinity chromatography. 
Cell surface expression of dlk in stromal cell lines was accomplished by flow cytometry. 
Cells were incubated with dlk antibody and a similarly prepared irrelevant control antibody. 
Specific labeling was developed by donkey anti-rabbit-fluorescein isothiocyanate (Jackson 
ImmunoResearch). Stained cells were analyzed on a Becton Dickinson FACScan using Cell 

25 Quest software. 

dlk Fusion Protein Preparation. The expression plasmid pCD4-Ig contains cDNA for 
the extracellular domain of human CD4 fused to genomic sequences of the human 
immunoglobin heavy chain (Zettlmeissl, G., et al., (1990). cDNA for CH2-CH3 of human 
IgGi (Goodwin, R. G., et al., 1990) was cloned into EcoRl and Notl sites of pcDNA3 
30 (Invitrogen) to give the plasmid KB52.3.2. cDNA encoding the extracellular domain of dlk 
was obtained by RT-PCR with primers BP 151 and BP 152 using total RNA from NIH 3T3 
cells as template. The resulting PCR fragment was cloned into KB52.3.2 via Hindlll and 
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EcoKl sites to obtain the soluble dlk-Ig expression plasmid. pdlk-Ig or pCD4-Ig were 
transfected into NIH3T3 cells together with pSVNeo and stable clones were isolated. 
Soluble CD4-Ig and dlk-Ig fusion proteins were harvested and then purified by affinity 
chromatography on HiTrap Protein G-sepharose (Pharmacia). Primers: sense BP 1 5 1 , 
5 5 'GAGGGTACC AAGCTTCGTGGTCCGC AACC AGAAG-3 ' (SEQ.ID.No.: 87); 

anti-sense BP 1 52, 5 '-CTCAGATCTG A ATTCGGCCTGTCCCTCGGTGAGGAG-3 ' 
(SEQ.ID.No.: 88). 

Flag-dlk fusion protein was used to immunize rabbits for the production of dlk 
antiserum. The protein expression plasmid pcDNA3-Flag is a modification of the plasmid 
10 pcDNA3 (Invitrogen) and contains the coding region for the Flag peptide (DYKDDDDKI) 
(Hopp, T. P., et al., 1988) as well as a BgR\ restriction site. A cDNA fragment encoding the 
extracellular domain of dlk was obtained by RT-PCR using RNA from NIH3T3 cells. 
Primers: sense BP 155, 

5 *-GAC AAG ATCTCAGCTG AATAGCGACCC ACCCTGTG-3 ' (SEQ.ID.No.: 89); 
15 antisense BP 154, 

5 '-GCATCTAGAGCGGCCGCTCAGGCCTGTCCCTCGGTGAGGAG-3 ' (SEQ.ID.No.: 
90). The PCR fragment was ligated into pcDNA3-Flag to yield pFlag-dlk. pFlag-dlk was 
transfected into cos cells. Purification of the Flag-dlk protein from cos-conditioned media 
was performed according to manufacturer's directions using the Flag monoclonal antibody, 

20 Ml, immobilized on agarose (International Biotechnologies). 

Plasmid Constructs and Stable Tr ansfection . Full-length murine dlk cDNA was 
obtained by . RT-PCR with primers BP 151 (see above) and antisense BP 200: 
S'GCATCTAGAGCGGCCGCGAACGCTGCTTAGATCTCCTO' (SEQ.ID.No.:91), using 
total RNA from NIH3T3 cells as template. The product was subcloned into the vector 

25 pCRH (Invitrogen) and then cloned into a retroviral expression vector (Kitamura, T., et al., 
1995), (G. Nolan, Stanford University), via the primer-encoded Hindlll and Notl sites. 
Supercoiled plasmid was transfected into BFC012 stromal cells together with the pZeo 
(Invitrogen) selectable marker and selected in 50 ug/ml Zeocin (Invitrogen). BFC012 cells 
also were transfected with pZeo alone and selected as above. Clones from both selected 

30 populations were isolated and all remaining colonies (100-200 per dish) were pooled and 
expanded as populations. 

Hematopoietic Stem Cells and In Vitro Hematopoi etic Assay s. Hematopoietic stem 
cell populations were derived from wild type, Ly5.2-C57B1/6J (Jackson Laboratories), day 
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14 fetal liver, enriched for the AA4.1 + , Sca-1 + , okit\ and lin ,o/ ' phenotype, by 
immunopanning and fluorescence-activated cell sorting as described (Jordan, C. T., et al., 
1995). Adult bone marrow (BM) was used directly after density centrifugation and 
immunomagnetic bead depletion or was further enriched for Sca-T, c-kit*, lin lo/ " cells by 
5 flow cytometry as described (Okada, S., et aL, 1992). Cell sorting and data analysis was 
accomplished with a Becton Dickinson FACS Vantage using Cell Quest software. Stromal 
cell/stem cell cocultures were initiated in 12-well trays with 300-1,000 enriched stem cells 
per well. Cobblestone areas were quantitated by inverted-phase microscopy as described 
(Ploemacher, R. E., et al. , 1991). Clonogenic progenitor assays were performed with either 

10 freshly purified stem cells or cells harvested from the stromal cocultures. These were 
cultured in cytokine-containing semisolid media according to the manufacturer's 
recommendations (Stem Cell Technologies, Vancouver, BC). Soluble dlk and control fusion 
proteins were added to semisolid progenitor assays at concentrations of 0.1, 0.5 and 1.0 
ug/ml and also to BFC012 stromal cocultures at concentrations of 0.1 ug/ml. Fusion protein 

15 was replenished weekly in the stromal cocultures. 

Competitive repopul atine transplantation assay . Cultured cells were harvested, 
combined with fresh unfractionated BM obtained from congenic C57B1/6 Ly5.1 mice 
(National Cancer Institute) and transplanted into lethally irradiated (10 Gy, split dose 3 h 
apart from a I37 Cs source, 1 Gy/min) Ly5.1 recipient mice. Each mouse received 2 X 10 5 

20 competitor BM cells and a fraction of the cocultured stem cells. Mice were bled by capillary 
puncture of the orbital venous plexus and 100 ul was collected; red blood cells were 
removed by NH 4 C1 lysis. The nucleated cells were stained for the Ly5.2 (CD45.2) allelic 
marker using either fluorescein isothiocyanate-labeled directly conjugated Ly5.2 monoclonal 
antibody or a biotinylated form developed with streptavidin conjugated to Texas red. Cells 

25 also were stained with directly conjugated antibodies to lineage markers. All antibodies and 
chromogens were obtained from Pharmingen. Flow cytometric analysis was done on a 
Becton Dickinson FACS Vantage using Cell Quest software. 

Results 

Genes expressed in AFT024 but not in 2018 were identified by a subtractive cloning 
30 approach. Sequence analysis identified one of these AFT024-specific clones as dlk. 
Expression studies, (Figure 19A) show high levels of dlk in AFT024 and subclones isolated 
from this line, but undetectable levels in 2018 and BFC012. The latter two stromal cell lines 
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do not support repopulating stem cells. The PA6 stromal cell line and NIH Swiss 3T3 cells 
both show expression of dlk and were the cell sources for identification of SCP-1 and dlk, 
respectively. PA6 cells have been shown to support in vitro hematopoiesis and long-term, in 
vivo repopulating stem cells (Kodama, H., et al., 1992). Swiss 3T3 cells are also capable of 
supporting multipotent hematopoietic stem cells in vitro, promoting CSAs and maintaining 
in vivo spleen colony-forming units (CFU-S) (Roberts, R. A., et al., 1987). Interestingly, an 
additional fetal liver stromal cell line, 2012, which has some degree of stem cell supporting 
activity (Wineman, J., et al., 1996), and its subclones also express dlk. Furthermore, an RT- 
PCR analysis (40 cycles) of an additional 10 fetal liver-derived stromal cell lines and several 
other lines, shows detectable levels of dlk in only two additional lines (Figure 19B). These 
two cell lines (CFC032 and CFC008) can maintain some level of long-term transplantable 
stem cell activity present in whole BM (Wineman, J., et al., 1996). A correlation between a 
stromal cell line's ability to support stem cells and the expression of dlk was suggested. 
Therefore, functional studies were undertaken in order to delineate if dlk can act on or 
modulate hematopoietic stem cells. 

Soluble dlk protein was added to progenitor cultures in semi-solid media. The soluble 
protein consisted of the dlk extracellular domain fused to the Fc portion of human IgG|. The 
stem cell sources in these assays were highly enriched fetal liver cells (AA4.1 + , lin ,o/ ", Sca- 
l\ c-kit + ). The influence of soluble dlk on hematopoietic progenitor colony-formation was 
assessed. As shown in Table 6, no differences were noted either in the number, sizes, or 
lineage compositions of colonies. Identical results were obtained at dlk concentrations 
ranging from 0.1 to 1.0 ug/ml. In addition, no differences were noted in similar studies 
using enriched BM cells (Sca-1 + , c-kit\ lin" 0 ). 

Evidence for a positive effect of the dlk protein on stem/progenitor cells was 
observed when the soluble form was added to dexter-type cocultures. For these studies a 
stromal cell line (BFC012) was used that neither expresses endogenous dlk (see Figure 19) 
nor maintains significant in vitro hematopoiesis. In four experiments, two each using highly 
enriched adult BM and fetal liver stem cells, we monitored the appearance of CSAs over 
time. These colonies provide a convenient, quantitative estimate of hematopoietic activity 
initiated by primitive stromal dependent stem/progenitor cells. As shown in Figure 20, the 
addition of soluble dlk (0.1 ug/mL) results in an approximately 2-fold increase in the number 
of CSAs initiated by purified fetal liver or BM stem cells over a 2 week time period (P= 
0.001 for dlk vs control and P= 0.01 for dlk vs no additive, Student's t-test). There was no 
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difference in the numbers of CSA observed in BFC012/stem cell cultures with or without 
control fusion protein (mean of no additive/control = 0.96±0. 11) 

In order to assess the activity of the normal transmembrane form of dlk, a full-length 
dlk cDNA was transfected into BFC012 cells. Expression of the introduced dlk was 
5 demonstrated at the RNA (Northern blot) and protein levels using both Western blot and 
flow cytometric analyses with rabbit anti-dlk antibodies. The flow cytometry data are 
presented in Figure 21. 

Initially, dlk-expressing transfected populations (BFC-dlk) were compared to a 
negative control "mock" transfected population of BFC012 cells. A 4- to 6-fold increase in 

10 the number of CSAs was observed in two separate experiments. The maintenance of CSAs 
was transient, lasting less than 2 weeks. No further hematopoietic activity was observed 
during an additional 2 weeks of culture. Once dlk-expressing clones had been identified 
from the transfected populations, they were studied for their ability to support CSAs in 
experiments designed to more precisely identify the time course of hematopoietic activity. 

15 Ficoll-separated, lineage depleted BM was used in these experiments. Five different 
negative control, non-dlk expressing BFC012 cell groups (parental BFC012 cells, two 
"mock" transfected populations, and two "mock" transfected clones) and 3 dlk-expressing 
BFC012 cell groups (one transfected population and two clones) were studied. The data are 
presented in Figure 22A. Neither the negative control BFC populations nor the "mock" 

20 transfected BFC clones supported high numbers of CSAs. In contrast, the BFC-dlk 
populations and the two individual dlk-expressing clones supported significantly greater 
numbers of CSAs at all time points studied (P< 0.001 days 3,4, and 5; P< 0.01 days 6 and 7, 
Student's t-test). As observed previously, all the CSAs were transient. This experiment 
also indicated that the dlk-promoted hematopoietic activity peaks early, at 4 days, in this 

25 culture system. Three additional experiments using purified (AA4.l\ lin Io/ \ Sca-l\ c-kit + ) 
fetal liver stem cells were performed using two individual clones, BFC-dlk-5 and a "mock" 
transfected negative control BFC-Zeo-1. The results are presented in Figure 22B. There was 
a dramatic and significant difference in the number of CSAs observed in the BFC-dlk-5 
cultures compared to the control line (P< 0.001, days 4, 6, and 8, Student's t-test). As 

30 before, the effect was transient and the CSA declined in number over 2 weeks. AFT024 was 
included as a positive control and, in each of the three experiments, verified the quality of 
the input purified stem cells. In the first week of culture the numbers of CSAs observed on 
AFT024 were similar to the numbers in the BFC-dlk5 cultures. 
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In order to address the "primitiveness" of the cells that give rise to the CSA observed 
in the BFC-dlk cocultures, a series of in vitro replating experiments were performed. 
Individual wells were harvested at various time points of coculture and the cells were plated 
into semisolid cytokine-containing media. The numbers and lineage compositions of the 
5 colonies were scored after 8-12 days. As shown in Figure 23 A, the CSAs obtained from day 
4 BFC-dlk-5 cocultures contained numerous progenitors capable of extensive proliferation 
and multilineage differentiation. The total number of progenitors from the dlk-expressing 
cultures at day 4 was significantly expanded compared to the content in the freshly purified 
uncultured stem cell population (P= 0.01, Student's t-test). The number and lineage 

10 composition of colonies derived from parallel day 4 AFT024 cultures was nearly identical to 
BFC-dlk-5 derived colonies. The content of CSAs replated at day 6 from the BFC-dlk-5 
cocultures was devoid of multilineage colonies, although CFU-granulocyte-macrophages 
were maintained at high levels; the progenitor content in the BFC-dlk-5 cocultures continued 
to decrease when next sampled at 10 days. In contrast, few progenitors could be 

15 demonstrated in the BFC-Zeo-1 cultures (P= 0.001, BFC-dlk-5 vs. BFC-Zeo-1, Student's t- 
test) (Figure 23A). Taken together, the data strongly suggest that dlk acts to promote 
stromal-dependent colony-formation by primitive cells capable of yielding large numbers of 
committed progenitors, including those endowed with a high proliferative capacity and 
multilineage differentiation potential. The lack of CSAs and significant progenitor 

20 maintenance in the BFC-Zeo-1 cultures argues that expression of dlk in the transfected 
BFC012 cells is responsible for both their ability to support CSAs and to generate/maintain 
primitive progenitors. 

In order to determine if the CSA-containing cultures supported by BFC-dlk-5 
contained stem cells capable of in vivo engraftment, portions of the same day 4 cocultures 

25 that were plated into progenitor assays also were used to transplant mice in competitive 
repopulation assays. Shown in Figure 23B are the results from two independent experiments 
analyzed at 10 weeks after transplant. The same BFC-dlk-5 cultures that contain CSAs and 
primitive CFU-high proliferativ potential (HPP)-Mix progenitors also contain repopulating 
stem cells at levels equal to those maintained in parallel AFT024 cocultures. In addition, a 

30 significant difference exists in the levels of repopulating stem cells derived from dlk- 
expressing cocultures compared to non-dlk expressing BFC012 cells (P= 0.05, Student's t- 
test). Multi-color flow cytometric analyses also demonstrated that both myeloid and 
lymphoid Ly5.2 cells are present in these animals. A subsequent analysis of these animals at 
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22 weeks demonstrated lower levels of repopulation with Ly5.2 cells derived from the 
AFT024 and BFC-dlk-5 supported cultures (data not shown). Most significantly, no 
repppulation was observed at any time point in mice that received cells cocultured on the 
non-dlk-expressing monolayers (Figure 23B). 
Discussion 

As part of an ongoing effort to understand the biology of the hematopoietic 
microenvironment, a panel of stromal cell lines from midgestation fetal liver was established 
and characterized. Among these cell lines, the AFT024 line has the ability to maintain 
nearly quantitative levels of transplantable stem cell activity for extended in vitro time 
periods (Moore, K. A., Ema, H. & Lemischka, I. R.,1997). Because these cultures are 
initiated with highly purified stem cell populations it is likely that the AFT024-derived 
molecular mechanisms responsible for this ability act directly on the stem cell population. 
Other stromal lines that fail to maintain stem cell activity were also identified. These 
observations facilitated a subtractive hybridization approach aimed at identifying potential 
candidate molecules whose collective actions may be responsible for the AFT024 stem cell 
maintenance activity. This effort has identified dlk, a transmembrane molecule containing 
six EGF-like repeat motifs. Although lacking the DSL motif indicative of the notch ligands 
delta and serrate (Tax, F. E., Yeargers, J. J. & Thomas, J. H., 1994), dlk is most closely 
homologous to delta/notch/serrate when compared to other EGF-like repeat containing 
molecules (Laborda, J., et al., 1993; Smas, C. M. & Sul, H. S., 1993). The predominant role 
of these types of molecules in cell growth and differentiation led us to investigate the 
potential role of dlk in hematopoiesis. Constitutive expression of translocated human notch 
(Tan-1) is found in a T-cell leukemia (Ellisen, L. W., 1991). Moreover the expression of 
Tan-1 in primitive human stem cells has been demonstrated (Milner, L. A., et al., 1994). 
Nevertheless, a functional role in hematopoiesis for the notch ligands Jagged (Lindsell, C. 
E., 1995) and Delta-like- 1 ( Bettenhausen, B., et al., 1995) has not been described, dlk 
expression is highly restricted in a panel of stromal cell lines. Two lines, AFT024 and 2012, 
which maintain repopulating stem cell activity in vitro, express dlk, whereas two non- 
supportive cell lines, 2018 and BFC012, do not. Interestingly, the S 17 stromal cell line 
which is considered to be a potent stem cell supporter (Wineman, J. P., et al., 1993) does not 
express detectable levels of dlk. The S17 cell line was derived from adult BM (Collins, L. S. 
& Dorshkind, K., 1987)). The other lines described are all derived from fetal sources 
(AFT024, 2012, and NIH 3T3 cells) or from newborn calvaria (PA6 cells). It is therefore 
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possible that dlk acts in a developmental ly regulated fashion. An extensive analysis of dlk 
expression in adult BM stroma is currently underway. Taken together, the data suggest that, 
at least in fetal stromal cell types, there exists a correlation between hematopoietic 
supporting ability and the expression of dlk. 
5 The potential activity of both soluble and transmembrane dlk protein, on 

highly purified stem cell populations were directly measured using in vitro and in vivo 
assays. Initial experiments designed to ask if dlk can enhance colony formation in cytokine- 
rich semisolid assay systems were negative. These results may indicate that: (i) progenitor 
cells capable of colony formation in semisolid assays do not respond to dlk, (ii) the 

10 collection of cytokines present in the semisolid cultures may "mask" an effect(s) of added 
dlk or (iii) that the soluble form of dlk requires a stromal monolayer to mediate its effects. 
The first possibility can be addressed more extensively in delta-type assays, where 
stem/progenitor cells are first cultured in suspension in serum-free media containing various 
cytokine combinations, with and without dlk, and then replated into colony assays (Muench, 

15 J. O., Firpo, M. T. & Moore, M. A., 1993). The second possibility can be addressed by more 
extensive studies using subsets of the cytokines present in our initial studies. These 
experiments are underway. As a first step to address the third possibility, we added soluble 
dlk to preestablished BFC012 monolayers. Using both purified BM and fetal liver stem cell 
populations, a significant increase in CSA colony formation was observed in the dlk 

20 supplemented cultures (Figure 20). This was a surprising result, given that dlk is a 
transmembrane protein; however, before its cDNA cloning, a soluble form of dlk was 
identified as FA1 or fetal antigen 1 (Jensen, C. H., et al., 1994). A role in hematopoiesis 
was not indicated in these studies, but expression was detected in stroma of placental villi, in 
yolk sac blood islands and in fetal liver (Jensen, C. H., et aL, 1994). It is of interest to 

25 determine if a soluble form is produced by the stromal cell lines that express dlk. An 
additional explanation, for the effects observed with the soluble form added to stromal/stem 
cell cocultures, is that they may be facilitated by the Fc portion of the fusion protein. It is 
possible that Fc receptors expressed by some of the hematopoietic cells in the cultures are 
able to sequester and present the dlk-Fc fusion protein more effectively. This possibility can 

30 be addressed by using a different type of soluble dlk protein. These studies have been 
initiated. Alternatively, the soluble dlk may be sequestered and thus presented by the 
stromal cell extracellular matrix. In order to further address the third possibility, an intact 
transmembrane form of dlk was introduced into the BFC012 stromal cell line. Initially, dlk 
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transfected BFC012 cell populations were compared to BFC012 cells transfected with the 
selectable marker alone. In these studies, the dlk expressing monolayers were more effective 
at promoting CSA colonies. As with the soluble dlk experiments, the CSAs appeared early 
and were transient. When cells were replated from the BFC-dlk supported cocultures onto 
5 an AFT024 monolayer a reiteration of the burst of CSAs was seen that was maintained for 3 
weeks. In addition, replating of these cocultures revealed a high progenitor content (-1 in 
10) that included a high percentage (43%) of multipotential colonies including HPP-Mix. In 
the BFC-Zeo control populations neither replatable CSA nor CFU progenitors were 
maintained. Further experiments with individual clones from the transfected populations 

10 confirmed and extended the results obtained with the populations, demonstrating highly 
significant differences in the numbers of developing CSAs (Figure 22). However, in two 
experiments we observed that one dlk-transfected BFC012 clone, which expresses a very 
high level of dlk, supported fewer CSAs than non-dlk-expressing control cells. These 
cultures also suggested differentiation phenomena, as indicated by the number of rapidly 

15 accumulating nonadherent cells. Experiments utilizing this cell line were not included in our 
analyses. It is possible that there may be a threshold level of dlk expression necessary in 
these cultures and when it is surpassed the cells differentiate and die rapidly in the culture 
mileau provided by BFCO 12 cells. In addition, it is possible that an aberrant form of the dlk 
protein is made by this line. Further studies are necessary to clarify this issue. Nevertheless, 

20 in Figure 22 A, dlk-transfected BFCO 12 cells (one population and two clones) show a 
significant enhancement of CSA formation compared to controls. 

The observed low level maintenance of competitive repopulating stem cells in short- 
term dlk-expressing cocultures is of interest even though the activity diminished over time. 
These studies show that the ectopic expression of a single molecule (dlk) in a previously 

25 nonsupporting stromal cell line restores or enables hematopoietic support. This is 
demonstrated by maintenance of three different stem/progenitor cell compartments: (i) 
CFU-HPP-Mix, (ii) CSA, and (iii) short-term in vivo repopulating stem cells. It is also of 
interest that both qualitatively and quantitatively similar stem/progenitor cell compartments 
are maintained in short-term AFT024 supported cocultures ( Moore, K. A., Ema, H. & 

30 Lemischka, I. R., 1997). 

Two mechanisms underlying the effects of dlk are considered. First, it may be that 
some level of dlk expression is sufficient to retard potent differentiation signals provided by 
the BFC012 cell line. Second, dlk may provide a proliferative stimulus not normally 
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produced by BFC012. A more direct and perhaps relevant assay will be to eliminate the 
expression of dlk in AFT024 cocultures, thus maintaining other components that make up 
the culture "milieu." Studies to evaluate potential neutralizing antibodies and various dlk 
antisense strategies are underway. 

5 The failure of BFC-dlk-5 supported CSAs to persist for periods longer than 1 to 2 

weeks also may suggest the existence of other molecules in AFT024 that facilitate 
hematopoiesis. In this regard, it is interesting that our subtraction screen has yielded 
several other clones with expression patterns very similar to dlk. Eventually, with the 
addition of dlk and other AFT024 specific molecules it may be possible to reconstruct a 

10 supportive phenotype. This should lead towards a further understanding of the in vivo 
hematopoietic microenvironment. In summary, it is proposed that dlk represents one 
molecular component responsible for the hematopoietic supportive activities of the 
AFT024 cell line. As such, dlk may define a novel molecular pathway of stem cell 
regulation by the hematopoietic microenvironment. 

15 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid derived from an isolated hematopoietic stem cell, the isolated 
nucleic acid comprising the following characteristics: 

- specifically expressed in the hematopoietic stem cell; and 

- encoding a hematopoietic stem cell - specific protein. 

2. An isolated nucleic acid of claim 1, the isolated nucleic acid further comprising the 
following characteristic: 

- capable of hybridizing under standard conditions with a sequence selected from 
the group consisting of SEQ.ID.No.: 1, SEQ.ID.No.: 2, SEQ.ID.No.: 3, 
SEQ.ID.No.: 4, SEQ.ID.No.: 5, SEQ.ID.No.: 6, SEQ.ID.No.: 7, SEQ.ID.No.: 8, 
SEQ.ID.No.: 9, SEQ.ID.No.: 10, SEQ.ID.No.: 11, SEQ.ID.No.: 12, SEQ.ID.No.: 
13, SEQ.ID.No.: 14, SEQ.ID.No.: 15, SEQ.ID.No.: 16, SEQ.ID.No.: 17, 
SEQ.ID.No.: 18, SEQ.ID.No.: 19, SEQ.ID.No.: 20, SEQ.ID.No.: 21, SEQ.ID.No.: 
22, SEQ.ID.No.: 23, SEQ.ID.No.: 24, SEQ.ID.No.: 25, SEQ.ID.No.: 26, 
SEQ.ID.No.: 27, SEQ.ID.No.: 28, SEQ.ID.No.: 29, SEQ.ID.No.: 30, SEQ.ID.No.: 
31, SEQ.ID.No.: 32, SEQ.ID.No.: 33, SEQ.ID.No.: 34, SEQ.ID.No.: 35, 
SEQ.ID.No.: 36, SEQ.ID.No.: 37, SEQ.ID.No.: 38, SEQ.ID.No.: 39, SEQ.ID.No.: 
40, SEQ.ID.No.: 41, SEQ.ID.No.: 42, and SEQ.ID.No.: 43, SEQ.ID.No.: 45, 
SEQ.ID.No.: 47, SEQ.ID.No.: 49, SEQ.ID.No.: 51, SEQ.ID.No.: 53, SEQ.ID.No.: 
55, SEQ.ID.No.: 57, SEQ.ID.No.: 59, SEQ.ID.No.: 61, SEQ.ID.No.: 63, 
SEQ.ID.No.: 65, SEQ.ID.No,: 67, SEQ.ID.No.: 72 or a portion thereof. 

3. The isolated nucleic acid of claim 1, the isolated nucleic acid further comprising the 
characteristic of encoding a protein capable of modulating hematopoietic stem cell 
activity. 

4. The isolated nucleic acid of claim 3, wherein the activity is selected from the group 
consisting of hematopoietic stem cell differentiation and hematopoietic stem cell 
replication. 
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5. The isolated nucleic acid of claim 3, wherein the protein is selected from the group 
consisting of a growth factor, a transcription factor, a splicing factor, a capping factor, 
a transport protein, a translation factor, and a replication factor 

5 6. The isolated nucleic acid of claim 1, wherein the nucleic acid comprises the 
nucleotide sequence of SEQ.ID.No.: 72, an analog thereof, or a portion thereof. 

7. The isolated nucleic acid of claim 1, wherein the hematopoietic stem cell is a 
primitive hematopoietic stem cell. 

10 

8. The isolated nucleic acid of claim 7, wherein the primitive hematopoietic stem cell is 
selected from the group consisting of an umbilical cord cell , a bone marrow cell and 
a fetal liver cell. 

15 9. The isolated nucleic acid of claim 7, wherein the primitive hematopoietic stem cell is 
selected from the group consisting of a AFT024 cell, a 2012 cell and a 2018 cell. 

10. A composition comprising the nucleic acid of claim 1, wherein the nucleic acid 
comprises one selected from the group consisting of SEQ.ID.No.: 1, SEQ.ID.No.: 2, 

20 SEQ.ID.No.: 3, SEQ.ID.No.: 4, SEQ.ID.No.: 5, SEQ.ID.No.: 6, SEQ.ID.No.: 7, 

SEQ.ID.No.: 8, SEQ.ID.No.: 9, SEQ.ID.No.: 10, SEQ.ID.No.: 11, SEQ.ID.No.: 12, 
SEQ.ID.No.: 13, SEQ.ID.No.: 14, SEQ.ID.No.: 15, SEQ.ID.No.: 16, SEQ.ID.No.: 17, 
SEQ.ID.No.: 18, SEQ.ID.No.: 19, SEQ.ID.No.: 20, SEQ.ID.No.: 21, SEQ.ID.No.: 22, 
SEQ.ID.No.: 23, SEQ.ID.No.: 24, SEQ.ID.No.: 25, SEQ.ID.No.: 26, SEQ.ID.No.: 27, 

25 SEQ.ID.No.: 28, SEQ.ID.No.: 29, SEQ.ID.No.: 30, SEQ.ID.No.: 31, SEQ.ID.No.: 32, 

SEQ.ID.No.: 33, SEQ.ID.No.: 34, SEQ.ID.No.: 35, SEQ.ID.No.: 36, SEQ.ID.No.: 37, 
SEQ.ID.No.: 38, SEQ.ID.No.: 39, SEQ.ID.No.: 40, SEQ.ID.No.: 41, SEQ.ID.No.: 42, 
SEQ.ID.No.: 43, SEQ.ID.No.: 45, SEQ.ID.No.: 47, SEQ.ID.No.: 49, SEQ.ID.No.: 51, 
SEQ.ID.No.: 53, SEQ.ID.No.: 55, SEQ.ID.No.: 57, SEQ.ID.No.: 59, SEQ.ID.No.: 61, 

30 SEQ.ID.No.: 63, SEQ.ID.No.: 65, SEQ.ID.No.: 67, SEQ.ID.No.: 72 or a portion 

thereof. 
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11. The nucleic acid of claim 1, wherein the nucleic acid is selected from the group 
consisting of DNA, RNA and cDNA. 

12. A vector comprising the nucleic acid of claim 1. 

5 

13. The vector of claim 12, wherein the vector comprises viral or plasmid DNA. 

14. An expression vector comprising the nucleic acid of claim 1 and a regulatory element. 

10 15. A host vector system which comprises the expression vector of claim 12 in a suitable 
host. 

16. The host vector system of claim 15, wherein the suitable host is selected from the 
group consisting of a bacterial cell, a eukaryotic cell, a mammalian cell and an insect 

15 cell. 

17. An isolated hematopoietic stem cell specific protein or a portion thereof encoded by 
the nucleic acid of claim 1 . 

20 18. The isolated hematopoietic stem cell of claim 16, further comprising the following 
characteristic: 

- capable of modulating hematopoietic stem cell activity. 

19. The isolated hematopoietic stem cell of claim 18, wherein the activity is selected from 
25 the group consisting of hematopoietic stem cell differentiation and hematopoietic 

stem cell replication. 

20. The protein of claim 17, wherein the protein is selected from the group consisting of a 
growth factor, a transcription factor, a splicing factor, a capping factor, a transport 

30 protein, a translation factor, and a replication factor. 

21. The protein of claim 17, the protein having substantially the same amino acid 
sequence as one selected from the group consisting of SEQ.ID.No.: 42, SEQ.ID.No.: 
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44, SEQ.ID.No.: 46, SEQ.ID.No.: 48, SEQ.ID.No.: 50, SEQ.ID.No.: 52, SEQ.ID.No.: 
54, SEQ.ID.No.: 56, SEQ.ID.No.: 58, SEQ.ID.No.: 60, SEQ.ID.No.: 62, SEQ.ID.No.: 
64, SEQ.ID.No.: 66, SEQ.ID.No.: 68, SEQ.ID.No.: 70, SEQ.ID.No.: 71, and 
SEQ.ID.No.: 73. 

5 

22. A nucleic acid probe capable of specifically hybridizing with the nucleic acid of 
claim 1 under standard conditions. 

23. An antibody capable of specifically binding to the protein of claim 17 without 
10 substantially cross-reacting with a non-stem cell specific protein or homologs thereof 

under conditions permissive to antibody binding. 

24. A cell capable of producing the antibody of claim 23. 

15 25. A method for identifying the presence of a primitive hemopoietic stem cell in a 
sample comprising: 

(a) contacting the sample with the antibody of claim 23 under conditions permissive 
to the formation of an antibody complex; and 

(b) detecting the presence of the complex formed in step (a), the presence of a 
20 complex formed indicating the presence of a primitive hemopoietic stem cell in 

the sample. 

26. The method of claim 25, wherein the antibody is labeled with a detectable marker. 

25 27. The method of claim 26, wherein the detectable marker is selected from the group 
consisting of a radioactive isotope, enzyme, magnetic bead, dye, flourescent marker 
and biotin. 

28. A method for generating a stem cell/progenitor cell from a primitive hematopoietic 
30 cell in a sample comprising contacting the sample with the protein of claim 17. 
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29. A method for identifying the presence in a sample of a compound that modulates 
hematopoietic stem cell activity comprising: 

(a) contacting the hematopoietic stem cell with the sample; 

(b) determining the hematopoietic stem cell activity; 

(c) comparing the hematopoietic stem cell activity determined in step (b) with the 
activity determined in the absence of the compound an increase or decrease in 
hematopoietic stem cell activity indicating the presence in the sample of a 
compound that modulates hematopoietic stem cell activity. 

30. The method of claim 29, wherein the activity is selected from the group consisting of 
gene expression, replication, differentiation, transplantation, and self regeneration. 

31. A compound identified by the method of claim 29, previously unknown. 

32. A method for identifying primitive hematopoietic stem cell-specific nucleic acids , 
comprising: 

(a) creating a primitive hematopoietic stem cell cDNA library and a non-primitive 
stem cell immune cell cDNA library; and 

(b) subtracting the two libraries, thereby identifying primitive stem cell specific 
nucleic acids. 

33. The method of claim 32, step (b) comprising: 

(i) contacting the nucleic acids of the stem cell and non-stem cell libraries with 
each other under conditions permissive to hybridization, thereby forming 
hybrid complexes; 

(ii) separating the hybrid complexes formed in step (b) from the nucleic acids 
which did not form complexes; 

(iii) isolating the nucleic acids which did not form complexes, thereby 
identifying hematopoietic stem cell specific nucleic acids. 

34. The method , of claim 33, step (ii) further comprising amplification of the nucleic 
acids. 
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35. The method of claim 33, step (iii) further comprising amplication of the nucleic acids 
which did not form complexes. 

36. The method of claim 33, further comprising displaying the amplified DNA on a 
5 chromatography gel. 

37. The method of claim 32, step (b) comprising differential display of the two libraries, 
thereby identifying primitive stem cell specific nucleic acids. 

10 38. The method of claim 32, step (b) comprising representation difference analysis of the 
two libraries, thereby identifying primitive stem cell specific nucleic acids. 

39. The method of claim 32 further comprising cloning the stem cell specific nucleic acid. 

15 40. The method of claim 32, wherein the stem cell is selected from the group consisting 
of AF024, 2012, and 2018. 

41. A nucleic acid identified by the method of claim 32 

20 42. A composition comprising the compound of claim 3 1 and a carrier. 

43. A pharmaceutical composition comprising the compound of claim 31 and a 
pharmaceutical^ acceptable carrier. 

25 44. A primitive hematopoietic stem cell specifically expressing one selected from the 
group consisting of: SEQ.ID.No.: 1, SEQ.ID.No.: 2, SEQ.ID.No.: 3, SEQ.ID.No.: 4, 
SEQ.ID.No.: 5, SEQ.ID.No.: 6, SEQ.ID.No.: 7, SEQ.ID.No.: 8, SEQ.ID.No.: 9, 
SEQ.ID.No.: 10, SEQ.ID.No.: 1 1, SEQ.ID.No.: 12, SEQ.ID.No.: 13, SEQ.ID.No.: 14, 
SEQ.ID.No.: 15, SEQ.ID.No.: 16, SEQ.ID.No.: 17, SEQ.ID.No.: 18, SEQ.ID.No.: 19, 

30 SEQ.ID.No.: 20, SEQ.ID.No.: 21, SEQ.ID.No.: 22, SEQ.ID.No.: 23, SEQ.ID.No.: 24, 

SEQ.ID.No.: 25, SEQ.ID.No.: 26, SEQ.ID.No.: 27, SEQ.ID.No.: 28, SEQ.ID.No.: 29, 
SEQ.ID.No.: 30, SEQ.ID.No.: 31, SEQ.ID.No.: 32, SEQ.ID.No.: 33, SEQ.ID.No.: 34, 
SEQ.ID.No.: 35, SEQ.ID.No.: 36, SEQ.ID.No.: 37, SEQ.ID.No.: 38, SEQ.ID.No.: 39, 
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SEQ.ID.No.: 40, SEQ.ID.No.: 41, SEQ.ID.No.: 42, and SEQ.ID.No.: 43, 
SEQ.ID.No.: 45, SEQ.ID.No.: 47, SEQ.ID.No.: 49, SEQ.ID.No.: 51, SEQ.ID.No.: 53, 
SEQ.ID.No.: 55, SEQ.ID.No.: 57, SEQ.ID.No.: 59, SEQ.ID.No.: 61, SEQ.ID.No.: 63, 
SEQ.ID.No.: 65, SEQ.ID.No.: 67, SEQ.ID.No.: 72 or a functional portion thereof. 

5 

45. A primitive hematopoietic stem cell specifically expressing a nucleic acid identified 
by the method of claim 32. 

46. A method for treating a condition in a subject comprising administering to the subject 
10 a therapeutically effective amount of the composition of claim 43. 

47. The method of claim 46 wherein the condition is an immune system condition. 

48. The method of claim 46 wherein the condition is leukemia. 

15 

49. A method of introducing an exogenous nucleic acid into a hematopoietic stem cell 
comprising contacting the stem cell with the composition of claim 42. 

50. A method of ex vivo expansion of hematopoietic stem cells comprising contacting the 
20 cell with the composition of claim 42. 
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FIGURE 7 
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SVWE ♦ T SG *P OS A ♦ TDOQ LXLFY ILGTWAI LXL LA 
ClqR SGVWREPSIHHATAASGPQEPAJXCSSVATQNNIXjTOCQKL^ 600 

AA4 vziLZmXMUO&EJZlKW*^ 644 

LC*L»Y KRRAK*EE K£KKPQNAADSYSVA r PERAES«-A ENQYSPTPGTDC 
ClqR I^lXVYRXRJttXR£DCX£XXPC^^ 652 
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FIGURE 18 
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Soaaien Xmmi crlek.priacaton.aAt 4 ?*g« 1 

I CCCACQCQTC: 030*3*0000 nQCCAlCtC CXTCCCAX03 TCCKAOUVAA 
51 tflUVIITlVf OCTMXCCAA XMMTCOXA AGAACACMC 

ioi CATPrrcTn: TJuaaoaavA aaaocacaaa ajuuoqccaa agactttog 

151 AATO3ACt7ni AAXAAXACC lQAOXTCCAT GXTCCCXCAT CXOOXATCTQ 
301 OOWOMJm: OXCXW7ICC AJUtmCATAC I ' lHTULHU AOMfftTftTS 
331 CMIStnCIt: MKCTCXOQfc AAMTCCTXQ CTAACCAflJ^ AQGTTCNAAAT 

301 aicmccfcv gactceaaao tciwitca3 tgmsgaaaax attqkxtict 
351 oarraacTPJ TuaoActAx jahuaacam juuctoak* ttocxtawc 



SEQID.No.: 1 
FIGURE 24 



WO 00/11168 



* Ml (ttWH) 

;, C 3 oc «n o3»c wwa»c*i AAdoooAcrc Acxcsjrrrro wacaccat 

OAKMCQXJk QCQRCTCG* TQCCAA99XT OMCAAAAXO CCMQAASOT 
101 T CXTA ' VUTJl TMCCCMAO CUTXCGAAAM AACACAOCCA I T L ' fC PTC 



SEQ ID. No.: 2 
FIGURE 24 
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S«««ioo Kami crirk.prtncaton. 4 P*g« 1 

X TAC0OCT3CG AflAAflAOaC *2AA0QOOAA MCAOCTOCCC T09CC0C000 
31 AGTXOCQAC&A TQGAAA0XOC CA30TTCCZQ 0C1CZCCM0C ACGAdOCOT 
101 ACQCVXCOAC: AMWXQCAU OCAACGOCCC OUKWICNWI OAOAAOCOOO 
151 AOAAAATOAJt TCOOACACIC TOUUMOXTT OOAWOCCCa TTTGAGCTAC 

2Ci TxecacAOjL ooicctctoc tccooomuq ccowaxctc ocaaxaaaac 

231 RAACMXASK CTTTXXTCHA WOCWCgCC T tfAMSAAOCQC XNCTC1MXC 
101 ATXAACITT- mnOflCXGC TOQCCWttff ATATOOOCTO OCTOCATTCA 
331 MTXJn iTr TAAAOTCCCA GTXCXOTQAA GAAAACA379 AAACTCTOOT 



SEQ ID. No.: 3 
FIGURE 24 
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S«*«ion k«mi crick* prices coo 1 

- occTQTuans ocrccaucc cccauxtic ctoctcmcc vosmmc 
hi aaancawAa cacctoaagc aomuaaoqc olxtcoooca tcttooogto 
xoi ccxTcoxax A flcuxMja r 'ctcaoaotoc w»cactcrr ccrocraoa* 

ira AMCCKVQAA OACAOCAflCT TOCKUCCK) TCCTCMWT CMCAXXCA 

201 GAOUOKCMC TTOOM30CCT CCTTCCXOCA TXCCOVCCQC AACCTCXCCC 

251 TOUtOJtXCT OU7IT3GACIO AACT0OOTQA ACAMTXCTO TCCIATOXC 

331 CGCTACKTCC TCWUlACOaX TGA3G&0CTC 1ATC7CMCQ ICCCAMkMC? 

3si oarorcvajia ctoxtacaaa qaaooooqcc itoooaout o cauauooo 



SEQ ID. No.: 4 
FIGURE 24 
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; " cccaaocane cqwciccpc caoboccacc uocaccict caaoaoccat 

si arxcwac; oceooaocca cccaooccac ccccicrca* OAAficcxAW 

xoi acaacwaau cxocaoc&ct gtacaocoot ctaaotcctt Txccnoaw 

151 OCICAOG/IOiV AOOAfifcCCXO 10CMCCTOC CACAAflACTO TOTACCCGAT 

201 oaiacoccw ctoqgaqaca AOCTcxrrrr ccacaactct TOTWCTGrr 

251 OCAAACftCXU CCACACGAAA CTCAGCCTOJ OCAflTTAIOC TOCAATOCAC 

301 CXnQJUTXTC ACT9CMMOC TCALTHCAO CACCWTTO MAHTAM0Q 

331 CAACTACQX'f OAAflOTnOO ICHAAAGAa GACAAOOACT C7U0OCCAC 



SEQ ID. No.: 5 
FIGURE 24 
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***«ioo Hani cvlcX.priac«toa.«Ai 1 F*e« 1 

1 *rtcar?*cor c r tacx cacM g coartccaoxx tiujiuuw: oaccacqcgt 

51 CCOCCCJwCO: 0TCCQA.T7XC CTOCAaOOOQ CACCAAXAC CTCTOUU3AA 

1C1 CAJOTICCJLK OCIOCCQGMf C COCC CX HOC CACCCCCIC? amUUUJOCC 

121 AANCCMICA^ IQQCAOCAOC JXOCINCAOC GOTCTAAOTC CXTTAACTTO 

3C1 COC30CTCM33 TQAAUaMOC CT9TQCA0CC TQCOOMXA CTUIV1VLVC 

2! 1 CXOOrXOOGM ACAAN7TQIP TITCCACAM ICTKVmVr 

301 OTTXAAAC* CTCCCACACC AAACICANCC TOOOCAflTEA lOCTOCKMO 

3M CACQOTtlXAT CTTACTOOtt ACCTCACTTT CA0CA0CT7T TOBflAAWA 

401 Aoooaca^ mmquw rrooroari* if >fir^Mtff omaL'im) 

CCaCKUHA WTOJOACICA OOCCOIANAN AOGCXOAKAA CCCTXTAACA 
501 CCCATTCCCT CCQWCACAT TOC ICCCM g TOMCAOTW G AAAACAAAA 
551 TJUiiCOrJCO GQJCCGCCTT POQAMWaAV AAAOGTCCCC CCCWC aX. Tr 
401 TTCCOGOTCA AOOOTCTCCC COWAATCCC VICI ' M HCC CCTOOOAAJT 

6ia wacwrrrca qaacccaaat # raaAAincc ccctncotin TrccmNr? 
7ui cccojocmct coccccawo <7icccraooA aogccccaa* cck 

SEQ ID. No.; 6 
FIGURE 24 
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s °/?r 



S«««ior. Karmi cr:.ck.prlae*toii.«du 1 



x ooooQAirrcr tqaotcqcat aocccomc otaaoctoxj atcciciamk 
? i Aoooaoiccc l ' l ' iTm ' iTr ' jrrnm. ' mi ' ttaaaiwm ttaaaaaoaa 



1S1 OAOCCWICA OCmsOCTC TAOCAAKAAC TTTAOCTTTO 100700*00 

am oAccocrac cctoaaaoct ocaooctcic vimx ' itfnc o xan occt c 

251 CCCCCTZASC QAMTOOCAT CrxtGAAOCC CACXMA3QG cxcrcrcccT 

331 CMttMlGT CKROOCXCA 'HttCCAMOCT AANQQOOM WJTTKXmO 

33i OAMteMMa amtzxnax (xxcmotoo aumuuuxA cxtoqoacct 

4ai i-aifumrrv macisc TM L - pjim two-vrr aaottocaac 

4*ji ro oo ct rui s ooccrccaic (xaAXco/roo ooroMftxtfe aotociactat 

501 AOIACCmT oaVOAAAATr tXAATTCTOO TTCOCAJUXC NACCTOCCAT 

5131 30GAMAAXT TAGGAATCCT fJOUgCCAAAC CCTTATCCCT OAAAACCTQT 

501 rrnooocxA ncctcatcct cticccamcc cog n occccc Noooc/rrrAA 

em xcnxcrrrc ceccracccc wccggoqaa ogncnotxt? tcctndqoaa 

701 AQOC1AAAAJX) OOTXTnrrU .SMOQQOGrtt COCQtCOCC CCCOWIGOT 

7.11 TNCCCEW.? TTCC CC CCC C CtfPttimu; OCCCCNAAAA CCN 



101 



□anwtcAC 



CACAflAAQC CAOtAOJQAT 00AOGAOCAT ANCTOCTOCC 



SEQ ID. No.: 7 



FIGURE 24 
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3 7?? . 



1 ggaetccgg-: t*tf?aat9*A tteftauffce gtaeatctc* tggagccmeg 

Si ffttMtAtg*: gt«9ce*c*ff tt*ctcff*«fc tAtte*cc*t ctettffmffg* 

101 tAOtttltffn trottm** crMQAQCAtg 4ceA«t99at «gactgtg«4 

isi tcccctu*oo tctttcctgt *m 

SEQID.No.: 8 
FIGURE 24 - 
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i ooooc^*» ojoflcoawc rrcoiiM fioo aooootmw qocotoxct3 

51 OMTUCUUA1 TTTTOCCXAC *3UtCTrCTC OCAQOCAAgA TATWQMflO 

151 cccri»»t wiwaoxc o anm/iqcc ooococcctc axttacctca 

301 TOMCmK StcSKSS AACWtCTTC ^MUMKA 

asi lacworxcc. aoMwrrtok cooetxtc*T Mcr acmrc aomc>c m o 

301 TAOGWCXCC MCTCIQWQ MMTOMOA AfiAMTTOCT CCTmCCIT 

ist SSSrcS- xvruwcjc *«g^ 

ootxaxtck, ocmqoctk craiw«ro wwwugo »S 



sai ««n«eic «c»*«er amuobm 222222 >rcrnaK> 

391 twomotw occiccoch mommcc* SJS^SS 

601 TEK7rXCCII. MTOMCIOC CBkTKCOJT trtCOMTOC C TWOWg 

m rararc iwuSaiMC MAHCIQCCX wicotbmk 

?oi caaoornc 



SEQ ID. No.: 9 
FIGURE 24 
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l«ition'NiMt crick.prlne«Can.*Ai 4 p *3* 1 

1 7KX<Xrj90? AKMCACOAC U3AA0Q(7ItJC OQOCXOCOAM AAGACGACAO 

;i AAAaaocaac: viwrtxx i OOCCCCOOCO COCCCOOCM ctcctccocq 

ic i accMOcror TCAcrrrcoc ooccrxctoc tatatoctoo cocioctoct 

lii cAccoct:ac3 crcxTCTicr ecoccatcto ocacaigaxa. nottttoato 

301 AOCTOAAOAC CQACEMCAAO AAACCCEATA OACOOIOCA ATACCCTQAA 

211 CCTCTWXCC TTCOSflAAOr ACTCXTXAC G03TCCTTCT OTOTCAHJTT 

301 TCTCXtrrOCQ CAOAATTQCIC CO*CCCtO0a CCTCAATATO CCCCTTtTOO 

131 ryrncn 



SEQ ID. No.: 10 
FIGURE 24 
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l TAcaxsoas AOM<ua»c aqaaooogm xtjl ' icvx c o3 TCCCAacorr 

51 QCCXTXCCX'C COAOATOOCA tCTWIAOCC OttOftCGCOA ACAOUQOQCA 

101 CCQCCQOAa: lUrrriA TOA CCAOAATCAA OCCCOOAAAT AOTICOCAA 

151 CTCACOOAIHJ ATtCACKrCC JOACGNUWT OACTtUOCQX GCGCTOOMC 

20i TCCTcunrrT tccmmarv caoccttctt wcicma cAiroociae 

aai aarccmaa: toactpoqaax rrxTAicrcx qaaoaaoqa c AcrAcrosar 

JC1 (XXKXTMiZ ATCAOCCCXQ CCMOTTOOA iqOCOC Cn O OACGAflATAC 

3!i Mtjuaacia*; toctomamo oatctaoccA kjociccttt caaaccqqcc 



SEQ ID. No.; 
FIGURE 24 
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S«**icn nam: &.eX. priacatan.edu 5 P*ar. \ 

i AGQCcrncafc oaaoxcqacx aAAooarxca octocgaoxx om q acagxa 

!i coarrxesoer ocgmmgac cu cjuiaaooo TAcooeroca Aaxxcuoax c 

1C 1 AQJUbQQnaaA OTUU*7TOTA aaaoctkxoc tocxuncca ocococanct 

1M TQXTTCACCr TCXCCTOCTC TPOCCACCCO CTCACCOOOO OTITCCOOCC 

20L QGAftMUlOr CX12XTXTQAA C00ACOO9IQ GXTTXTTTXfl TCACQQAOQ* 

3ii AOAOKrCAAC CXOftOSUaa OAOCTTCOOQ GCTQQOCriC AACMOOXQI 

301 CTOOCDU2AOA TOUtfMXXT C3TCTCCXACO JUJU3TOXXT CTXaJTCAflC 

351 OTTATCftXAW AAOftXOOOOC TtOBCOCW OAT9O9CO0C TCCAOOOOO 



SEQID. No.: 12 
FIGURE 24 
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x joctciaoxc cccaowato tmccaceac wmcuKT OCIOT0CCT0 
5 1 crocraoaoA x rritxu occ tomoaoocc aaacxoatcc coticcctct 

ICi TCAJWWXtt: CACCTtOOAC ACAOAATCTr AAACCCfcCIO XXT3CUTCJCC 

151 AACAxntxx: AajuxrmcT aoqacctccx li ' iitugfku caacccctcc 
aoi ' muiu.cn; tctccxxtw catoaacacc cwirAwnu OAACTATtao 

251 TTOWJUm CCTCCTCAGA AfCTTCACCOT TTfCTTECAC ACOOTICrr 
301 CCAACTITTW UJX ' lWJ rC C ACMOATOrC ATTICrlCAfl TTTQOCAH3C 

351 iwxitc ao: atoocttoa icccaa q qcc tccaoctcct tcagocccaa 

4C1 TOOCaCCAAa ' 1 ' 1 ' lUULAUl WtWQCCAC COTOTXTO AOCSCAAXCT 
451 **UU2CC/tf»\ CAATCTOACT ATCOGOOOCA TCCACOAOTC TXTXOTGACA 

sci tttcgauaoj crcwrooaA occcaocctc ATcrrrocrr taocccacxt 
551 toaioooat: ciw ccTc a ocitccccac Tcroocroro aacoaAonc 

6C1 AXCTCCOCr OCSAIQCOASC CTtOGAOCAAfi OaCTOCTOQA OAAACCCOTC 

esi ncTCcrcrr accxcaacaq qgattctqaa oocnctoATO qqoqaoaoct 

701 ' xw T i W* OOCtCAflACC CCOCTCACTA COTACCTCCC CTCACCTCCA 

751 TACCAflTCCAC CA1CCCTQCC r ^ACIQOCAOC TCCACATOGA (WXOTOAAC 

631 i7IC0CXnCAG OOCTOAflCCT CTOTOCCCAO 0CCTOCA0T3 CCATCCTAOA 

• 51 CACAOOZACA TOCCTCATCA CAOCACCTAO TCAOQAOA1C COQQ C CriGA 

901 ATAAAGCCAT TOOOOQATA? CCCTTCCtGA ATCOOCAOTA CTTCATTCAG 

951 TOTXCAAfiA OOCCAACOCT TCCCCCTOTC TCCTXCCACC TTOGTTOOAOT 

1001 CT30TXTAAC CTCACAOOCC «V3QACTA1OT CATCAAQATT CTICAOMCa 

103 1 ATQTTOXCT CTOCCTOTTO iJOCTTCCAAfl CCTTOOATAT CCCCAAOCCT 

1101 3C0t3OArct C TCrOOATCCT TOOOaAOTIC TTTITOOOOC CCTATOIOOC 

11.H TGTOrXI«£ CGTOOOOACA AAAACGTCOO CCCOCCCGTO GOACTCOCGC 

1201 OTOnCAOTC TOOTTCAACA ^ACCOOdGAO AAAGAAflGAC TACOCAOQCG 

US1 CAOTCT7T.A AAAOACOCCC TOQTTAOQOT ACAAGCTCAC CQOOCCACAG 
1331 CAOTtATOCT TCITTOCJLAt XAAACAAACT AAAAAAAAAA AAAAAAAAAA 

SEQ ID. No.: 13 
FIGURE 24 
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1 7CTT2C2JUU2 frww^min gQQ3LV'I f ("W ' ft*r»arv«wvi4 A m t ' i i j nirffT * 



*CTTCG*AArt COOOOCIUAO CQOAATCCTC CACAOOOGAA TCCTOOACGT 
51 QQAnXTrait) AOTCATOOCT CCTTCCCGAO ACOCTOATOA OATCCACAAC 
.101 OAOTTCAav ACTACTXTOO CJUOTTACTO AAGACATCTQ CAOACCtCCA 
131 CACTAAtfQCT TOT3XCACQC CAOCCAAOCC GOTCCCCAQC TACATCCOOG 
301 XAAflTCTOCV 0AAT7TACAC CAAGACOTTA CTTCOADflTX TTATOGCUTT 
351 GCTCTO/JTPJ TICCTOAOOO QCXGOAAAAC TQCCQAATTT TOGATCTOOO 

3oi TxaioocAar oocaoooatt octatcttoct taoccaocto ottoctoma 

351 MOGAOtTOr CACCQOAATA OACATQACtO AQGrTCCWr CCAAfitOOCT 
4C1 AAAACCVAXZ TIQAACACC& CA2QGAAA1A I ' l ' mttiTOUJ 

4*1 OTjwcrTTr crccAcoacc ocatcgaoaa arroacAOAa cctoou atcc 

SCI AOMOVyQAl CTATOOTAOT OTCATATCCA A CTOM TT A T CAACCITOIT 

55 1 CCItArUA: AACAAOTCCT CCAOGAGCTC TATO3AOP0C TOAXGCACOQ 

501 CGOCCttJCTC TATTTCAOTO ACOICTATGC CAOCCXTOAA OIOCCAGAXB 

6SL ACATCAWrr OCACAAAOtT 7TATO0GG0G AATQCCTOOG AOGCCCTCTC 

i-m TACtoanxoa atcttoccat cattocccaa aacatxgoo? tctoccctcc 

7T.1 AOT1TO3QTZ ACTOCOOAXA 'XAJTACTOT TCAAAACAAO QMCTCGAAO 

to: aaonecwa igactctcoc 'tototcto ccacatttcc cctcticaaa 

8M CXteCTAAGA CXGAGCCAGC COAAACATCC CCAGTTTJTTT ACAATOGAOQ 
S31 AATCAAOQGA GATGAAAAGG AACTAATTIT COATCCAAAT TTCACATTCA 
951 JUW/AG3CCA AOCTOPTOCA (7R3BATQACG AGACOQCAOC TOt C CttJ Utfl 

ioai aactcaott ttoctccoga ti ' uiutic acacctotio acoccicoct 

10 1*1 .SCCAOCrcCC CAOOOOCCOT '[tnUAOTTAC AGACAAAOCTT TCTAA1CAGA 
11U1 '1ATCCATOA AOCTTOCAOA OGACICTGAC AAGATGAAGC CCAOACATCC 
H!U ACCTCMOCC ACOOQACOCr iJCTCXOGCAA GAGGAAAAAC TCCTAGATC* 

uot Ac«rcwca cqgagcccac oqoqcicuo acgctrsocta aaggacacxc 

i2!ii ACAOAGX1T CTTAGCCTOC TCTTCGCCA5 TOCACAGATT ATOTOAAOCTT 

1301 30CAAA3CCA CCACAAOCTA »SACCACTCCT AACAATAAGA GTCACTtTTA 

133X GAOC1AT7T] A ATTGAAOC7IT 'ZACACCAAAT CCCCTOCTTT TCTATTTCTC 

1401 TATCTCAfiPO Wlimiuu: ACCTAGT3GT CACAM7TAGA AC7TOGAAGC 

1*31 cca.\qoit:a ctcaaaoqqc zaaaqocatc atcaacctto tcjagaatta?* 

15J1 CTrCClTOT3 GCCtAOCACA XACACCTCT OOGTICTTCT CTOIOOTTAC 

13 u camuoor AOTAcrrAcr aaatctatoc taaccaioac aaaaoatto? 

SEQ ID. No.: 14 



WO 00/11168 



ca^xajut nracaoGxo z vitl - it t a o witx t m\i ' r ocAATrrrrr 

1(31 TCTCtWfcAC TOAA30AAAA AOAAMCAAA TAAAAAATAA AXTTCUCRC 
1701 OAAAAAAAAJ. AA 



SEQ ID. No: 14 cont'd 

FIGURE 24 
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S»««lon &U8M1 *nrt=k.princ«on.«Su 3 5 A 

.1 C0CAC0C2TC QqOQ K/IVIO T3CAGCCCCC OQICAAAfiAC GCCATCCICT 

5 l accajcaoo. ccnsuAorrr oxaaoaaat cctoococaa Acroroooer 

101 CTOCT3TATO CQOOAOOCCC AICAOOOOTA Q C T LtXCTA Q AAAGC7300A 

IS L COWlXAUAr OLT l UtXC T Ua OACCAOGAQQ CGXCAOQ1CC ACAOOOCCCA 

201 CCCOTCOAOU OOAACOCOaa OTCATACOCT TOOCIGACTC WlMC T O IC 

2U CXOOTXCCOC) AIQOCQAOAa CTCTCCCA0Q GAjCACTOOTO CCTTCCXOAT 

301 TACCACCAC GAOOOAAOCC ACCTGTIDOC TOCACAOCAC C3CCAOICCT 

351 GOGTOTCC; CATCTOTCAO CTOOCCTTCC COOCT A CCOC AGAATOTIOS 

401 TCA03A3CW QACAOOCmi 0AA2OCAAAX *"v?^ 'VWI tTOCCATOQA 

451 ASAAAACTCT AT CMtfJWf CCIWOUMA ACTOACCtfAA HTlUJUmA 

sax T Og pytr aa wraAncr accicccoct qcckctqaa aooacocxac 
55i crcciarrac tojqccaaqa toacatccaa cxcagqgaga catccaaocc 

«01 CCAOC3CCIT7P TTTAGCTOOC CCTACOfftTT CCTOOTCAAfl TACOQCTCW 
631 ACAAOOOTGT (JlTLTLVl ' l ' J QAaOCTCGCC OCCOCTOTOA CICAOCIOAG 

701 acccwmo ccrrcAOTM cccoanocc ccacacatat orocxxnTOr 

751 OOCTOCCOO: ACTQCCOGCC JOOOOC2A0CO TCTTCCAflM CTOOCCATOT 

8C1 CCCCXCCCT* CCC C CTQ C CT COOOCCCTCT CCCTQ C CCTC CCTASAOCCC 

851 CCTGGAC'AOr TTCOOQAOGTT OOCCCCA0OA TTIGAGCXT3C CCA CT C CC AO 

901 AAAGCTOCCr CTMCTOATC CCOOOCCTCA AAOCCTACCA TOXTPXTCA 
OCCCCAACA; AXQAAOOACC OOCATCCOGfr CTCTATOCOT CCCTOIOCAA, 

1001 OCACACOWC AAOCACACAS OCACQQCOCA GCATTTCTAT OAQAAOGTCT 

lOil OGATOCTOOA OOCCAOCCTT tXJOCtGACCA A TOOQQtTrCC TOAAOCCCAA 

IIS! GAOOOOXCC Tt WW O CCO CAOCCCCCTP OaGCAOCCTT ATCTACCXTA 

llai ACACT0AOGA TCTGAflTTCX) CCOOOCICGO CCCAOOACAa CAATCHMAA 

1201 'JCCKAOTACC OQAOO C TOCT (lOAACXQQAa CTCQATCAOO CCOOAACCOC 

i2iii :oocrc7nT macooca okjgcatcaa ooccaagcto chgaccctoc 

1301 TOACCC3TCA ACOGAAOAJUQ iXJCCCCCCCC CCItTIQACOO O CCC T U AAGQ 
1331 CCTUAGIOCC CAOCCACTOC ,ttQACAOA0O TTUTCACCCA AOACCAOOAA 

1401 cAAcrrroA ca'caacccott ttactctoac ctocaoqgac aaoccaogtj 

1411 QCaXGOG/JG GAOCCACACT nOCCCTACC TCCICCCTCA GACTOTACW 
1301 ATXUAACAfft AATAAAOCTT XCTATCAAC TTCAAAAAAA AAAAAAAA 
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1 AOAGAC^OO? TQATCCO30C CTOCACOOG OCAOCTrm CTOTCTWaOO 
5 1 AAGAAATCC7 CAAAOTCCAT OOAOTCTCAA QACTCTQftCA AOCCTCOCTfc 
1C1 (WJ»JUrJT>2 GAGTTnJM3A COOCACTTOO CACCOOAAOC TAOCCOOCTA 

i;i Qccoowjccr cxcctoqjwt cytfrrcACAfl croccnueac aoocxcagac 
a<n taogtoctos ocAcerwaA oqaoqaqcxs acattaocm? caaaoocott 

311 TAAC*a\X3T 0CCT0CCTN3 OCTTOOAOOC AUUCGC7QC TarTTCACAOT 

301 acawjAcocA oartKaaAax* ^aatqocxqt ccaoqtocto coocacaiog 

1T.1 Tcaucrrccct jurtQAOtcra vrrmn-iuu tocaaoctoc acacaoiqoc 

4oi aocccccgacj A*au?rxra? cnrraroac u t uLOJu ec Aa*cae»CA 

iiii auacAcccTC cactmoaic AAicnouA occtocxic TnaiatoOA 

3,u AovacGaoa* aucccrcACA aitcctoccc cenccxoac aoccccaoat 

3si atxccosoct tcitcccaga ucctacaow ctctatcact tctocctcia 

6-ji craaMrrcec cacaciwoa oactocacit cccciatooc aaocamac? 

ssi Accrcraa taqccaaocc 'kxaoacxcc tctocticca CAAACAOtw* 

7ai awoftoorrcx aqootoxx! cccoctqatc occacctcto icAociccia 

7U1 OCAOATrCCC CAOAACACCA i JCUTUCCIUJ OOCTCCOAOC TTCAICTTCT 

801 cerxc*c»A toccccacac .guorcroc agaasocaic tctogacato 

831 IOTElATrtCA AflAAOOAATT (3CAOCAOCIT AOCA0OTACC TOCAOCACCC 

90.1 ICAAAA3GCT OCCAA0COOC 'XACCOCAOC QmCATCAOC CAOCAOTTAC 

9!>1 ACAOCCrOCA CTCAAAOCIC . VX T CTCT CA OCTICCTOOa AOACACATEA 
1001 AOCACOaX/r 'iAATCCTACA CTOT0GAA0C TOCCACCCAC 

10!) I AflCCOaiCTA GAOQATCIOC ATATCCACTC CCAQAAOQAO GAGGAOCAGA 

1101 QfflGAOGTCCA 0OCATACTCQ ■ JTCH ' IULTH CCCOOQCCCTT ATTCCACCAfl 

11 J L ACCUSAaOCC QTCOCCCOOA 'tOACOXAAa AOOClWlUi TAOTASACIT 

1MI CA0CMCCJA OCTXTQTTOC AOOACAAOAA rTCTAGCCAA OTCCTQOOTO 

U U AGJUOGTCTT OGGTATIOTC -7T0CAQAACA CCAAAOHTCAC CAACCTCTCA 

13H OATOCCTICO TACTCACCrr CCAGCACCAC CCTCAOCCAA AAAATOTCAC 

13 u tctx^gtoc xfwajj ttoaaoaccc cocatcamc aocacaogoa 
1431 ockoaoous tccaccctcc maca^oa ocaoaoacac acagacatcc 
1431 tocctgtoca accacctgac ctactxtoca g1cctsat0q tctcatccac 

1131 AflA3rAniA QCCACTCACA AACACTACCT CACOCTCCTS TCCTACQTO3 

lssi ccrrrcratf cr ed e re ocitotgtct tcactaicoc tocctacctc 



SEQ ID. No.; 16 
FIGURE 24 



WO 00/11168 



PCTYUS99/19052 



2 



1401 TOCTCCAMA 30AAOXCACO T3ACTACACC ATCAAAOXCC ACATQAACCT 

i«3i aenxrccxr w - iiriu- ' im- tsgacqtoaa cticctoctc Aoauuaccxo 

1701 TOOCACTOAC QOOCTOOCJLA orAXCTOIC OOCCWTtOC CATSTICCTC 



180 L TCTCTJUCCOJi CXWIWUM JLWTCTItXXI TWCTWWXO CCCOOCTAIC 

1831 tocrauuac:: qwjcatcoto coctoocxttt - niranvri ' acroorcAcr 

1901 crociocoa? ■rearaaAiar oaataactac o oc c cca to, Tcerwcttrr 

1351 QCaUaCACT TCACCrACCC CTCTXTCTOC TOGAICCQOO 

2001 ACTcccmxrp cMcxKxarc aocaacciog accicncw icrasrarrrc 

2051 clWlUA O? TSXTXStrT COCeACCMQ CTOTTCCAflA t cc tocqqct 

2101 tcoccccicao Aocauamgcr coccccaogt oercAcccta ctooocctca 

2151 occrcwvccr rooccirocc itaaaccrroa iviravni: ctttocttcc 

22C1 QCCA UXW r JUXTTETZCXT CCTCTACCTC TICAOCATCA TAACITCCEA 

2251 CCAAOOCTr: CTCATCTTCC TOIUCTOUCTO 0TCCATQC0O TICCAOGCCC 

23C1 AAOCCQnOCJ CTCCCClg T q AAOAACAACT CAOACAOCOC CAAACXCCCC 

23! 1 ATC?CC?CC3 OCAOCACCTC C.TCCAOCCGC ATCTAA0CCA CCQCCACACC 

2401 ' re c cx rgxoa gjumacacat ocatogottc cgcicacqat otootwcc 

2 4!* I CAOT»rraT3 CCCACCCACC cmi/l ' lU Cr TAOTOGCATA CTAGAflAAQO 

2301 CCCltagtt x r TCAAOOCOTA (MOCTCTITOC rVlUUl AOCTT ACATACCTAO 

3551 crroccrxaa ooacgactc? <«tcctcaaa gogcicaoaa ocacactocc 

2soi ATTcrcrno Tsaooccarrr 'ccacictcga octaaoocct unvmc t q 

2<;;i scavcctciG octccagctc tioctcctog gtgtcoaoac ctocagaccc 

2701 AAGCIC30CT 7AOATCTCOA .VOOAOGCTGA CACAT0COGC CTOAOACACA 

2711 3CEVACWIC TTCACPTOCT tXTTCTOTCTC WP3CTCACC ATOCAOATCC 

2801 CONX3QXOCC ACTT3COQOTA AATQTTCTOO QAOAACCnTT QGAQGCAOA0 

2 m CAorrTAoc* octgagcatc tcccccaocc tticiocaaa ccctcctctt 

3301 CATPCCCO.T CCCCAACCCC rCCTCTCTCT TCCCCTAACC CTCCACCTGA 

29 51 agoztgoocjt cctaoaccaa toctctcatt toocxttocsta otxcccacca 

3C3i orrrccTOCTr occaoctatc aa ctivivic TGrtaroroo ocmooccr 

3C51 CTCAClCAiC OCAflCTTTCT QTCTCUUXXC TCICICCAAC CTOCCtCACC 

3101 TITXTCOCA CCTCACAGO<3 ACCTCCATCT CTCCtQAJUX n CCTCCCTC 



173 L 



cactotccc: iul ' iuulvcj cx T waaoa ATQOOOCICQ ACWXTXCAA 



SEQID. No: 



16 (cont'd) 



FIGURE 24 



WO 00/11168 



PCT/US99/19052 



S«*«ioa Mioii crick, price* ton. •dbu 1 &^>2£ 3 



3151 TOOCUCtKJ 700CJLTXOU3 CCACCC7ITC AACCCAQCXC TCTGAAGXCC 

3201 AAcx^ysca: ccictwma cactxccaa acrcwtcrr txtcctaaca 

3251 AOX03TCTR: AOAOCOCGSC ACWICOCTCA OAAOACACTC OOCXQCCTAS 

3301 TOIQJkATTCT? (RCCTACRUt CQXKJUHQA OCMKTCCTC JUZCCCCAfiCC 

3351 COQCUAA3C TCXC*CC«fl ICCJOQWfIG TCAQOGAOOa OOCTOAMT 

3401 CCMSAOOJU: TTCCTQCAM &JOCAGCATT TCXTTITCAC CTCMXXTIC 

3451 AoamocGav aAATarnxrrr ittaaatxcc Acrrcxrrro totitqatx 

3501 TXAAAOCtCr TTU9U0MJU) TCTQQAAACT GTAOOCOATT OTCQAOAACU 

3951 auiTiM*w waocmna, remroocL roauaai qaraum 

SEQID. No: " 16 (cont'd) 
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l.Saq L«igch< 430 fiifcruAxy 24. 19»7 I3i2? Typ«» N Ch«eki 1795 

1 A5COACAMO CCCXCCCCOC GCTCCWWCC CAOCCTCCAN OCGOCTCTCA 

si actoaoottc ckjcwttcc tgctoctott ucwciu.m ctqtcatoqc 

131 CATCQCAM3Q OCiuCNCCCTO OCAXMCCTO AACAGCCACC CTCCQGCCCT 

131 OAflTCCCAAC TCAACOCCOA CMAACTACOO OOTCMCTTCC AOCACCTGCT 

201 OACCCOOCW CA7QCCA3WC ATAOCCCACA GOACTCTAAC TCTOAACCAA 

251 CTCCTOACCA TCTOTOttCA TACTCAMTCC ACAMCYrOAOA TTOOOOTCCA 

301 COOCCAOCTQ CT.VCTCOfCO TCAACCOQQC GTCQCTOXMT CANOaTCTCC 

351 CCOAACCIAC OttOTOCAMC MAOCCCTOCT CCTQCTOACH MCKAAMXCO 
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l oaoAcocnrra cxoaaaocoo ctoqooatat acotot o cw oxqccctaca 

Si ATTCOACAOG CCTCTTOOQA OOOTCO&CCA CCTTOCACra TAOTCWACT 

101 TCTAATOAOA AlOrtaACTAT CACTCAAATA ACCTOOATGA AOAAHCATTC 

151 AWTQOA2CC CACOCTCWa tOOCTOTCTT CCACCCCAAO AfcOOOQCCCA. 

301 ACATCAAAOA QCCAOAOAOa OTOAAATTCT TCOCTOCCCA ACAOOATCTQ 

331 AOQAACCKAT CtCIQOCCAT CTCOAACTTA AGTOTHHAAa AOQJUMSCAT 

301 CTAT3AACOT CXQATTOCCA CATTCCCCAG AAOCA9TAHA AOCACCAATQ 

351 CCTOOCTOAA KTOGAAOCC COACCttUUU ACACTOCAMA AOCCCTOGAC 



SEQ ID. No.: 18 
FIGURE 24 



WO 00/1 1168 



I CSAATTCCra OOCXOACCCA COCCwrCOAQ AflOOTAOACA CAAACCCCOO 

51 AAAOGOCT3T OACXSTCAAAT OOACCSCATO OAACTOAACA ACCXCAOCAT 

101 CCAACCCGAC Oa'wOACAOCT GCAGCCCaQA CAGTATTCAC QACAGCTACA 

131 CCOCCATOCA AAACTCCOAC AACOACOCCA TOAACAOCCA ATTTOCTAAT 

201 CAAOATOCCO AA-V0TCAOAA OTTCCTOACA AATOOOTTTT TAOOCAAOAA 

391 CAACCTACCC OATTACgCOO A.TOAOCATC ACCCTQOAAT GACTTCCTTT 

301 COAAXOTCCT CAfTTAACCt OJUSOACflCC ATCATOOGCA OTOO0ATCTT 

331 J«CTTQTcC tAracCATOO CCAACACCOC OATCATCCTT rrtATAATCA 

♦01 TOCWCTTAC tOtWCAATA CTCTCOCTCT ACTCOOTTCA CCtTTTOCTO 

<*L iJttACAdCGA MOAJyOOAOa CTCTCtAATC TATGAAAAAT TOOOCOAOAA 

sax AucA-rrroax taaccxccoA aaattooajoc cttcatctct attagaatoc 

351 M2AACATTTO AGCCATQTCA AOCTACCTCT TCATCATTAA GtAfiOMCXQ 

601 CCTQAA97AA TCAOAOCATT CATOOCACTT QAAOAAAACA CTOOCGAATO 

631 CTACCTCAAC OCCAACTACC TCQTCTTATT TQTQTCC G T Q GO0ATCATCC 

701 TCCCCCTCTC TCTCCTTAAA AATTfAOOCT ACCTTOOCTA CACCAGTQOA 

751 'CTTICTCTCT CCTOCVWOT UrfTfTCv3rC AOTCT0OTOA TTTACAAAAA 

AOl ATTCCAAATV CCCTOCCCTC TOCCTOCTCT OCJATCACAAC AACOOAAATC 

831 roAccrwAA aj^cACAcrr ccoattcaca tgatctcoct occtaatoac 

901 rCOCACAjCC COCCTCTOAA CTTCATOATQ OAtTACQCTC ACCACAACCC 

331 A0CTgOG2Tg CJ.TCA0AACC AOGT2GCACG cCCTCTTCAC AOCAATOCCO 

1001 TOOXaTAZOA AOCCCAOCCT GCTOAOAAAT CCCAACCAAA AtA C TTTOT Q 

1051 TTCAATTCCC CAiACOCCCTA TOCAATCCCA ATCCTOGCTT TTCCTTTTGT 

1101 CTCCCACCCT awGGTCCTTC CCATCTACAG CCACCTTAAA OATCGATCCC 

1131 GCAOAAJlCAT OrASAC-iGTa TCCAACATTT CCATCTCAOO CATOCTCOTC 

1201 ATOTACCTTC TTGCGOCCCT cmaCTTAT CTOA0CTTCT ACOCOOACOT 

1231 TOAAOACGA/.i crGCTGCATa CTTACAOCAA OOTCTACACA TTTOATACOa 

1301 CT C Pr C T C A'? C37GC0CC7O OCAOTcCTGO TOOCAQTOAC ACdTcOACCg 

1331 uCxCCCCATO arUCTQTTCC CAOATCCOtT ACTTCOGTAQ tXTCXCACT* 

1401 cOBTCOTTTCt: AAOOAAACCC TTC/.Oy*CCC TQAAGCATTT CAOOOATOtC 

1431 wOcCflAATCA TCArCGCACT CAAIIAACATC CTOOTCATCC TCOTOCCtAC 

1501 CATCAAJwTA: ATCTTTCCAT TCATAOGOOC TTCTTCTOCC ACTAT3CWA 
1351 rTTTCA'fTCr ICCOCCTGCG TTTfATCTCA AGCTCCTCAA 5AAASAACCT 

1601 7TAA0ATCAC CCCAOAACAT TOOWCTTTa GTCTTCCTTQ TCXCTCCAAT 
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1631 


TvrrricAra 


ATOOOAA0CA TCOCOCTCAT 


TATACTCOAC 


TOOATCTACA 


V01 


ACCCOCCQAA TCCXAAtCAC CACTAATCCC 


OOOOAOACOC 


GTCTCCACTO 


1751 


OAAACAOCrO 


AAJvrrarcTa aaoqacattt 


TAOTTQTC1T 


AATTOQGATd 


1*01 


TTAQTCTOAO 


OXvTtAOCAA QATTCCAAAO 


ACOTTTTTCT 


AOCTCTATCA 


1191 


ccxcxcAtrr 


TXiCCCAOOC CarttJCJUITOC 


AffXOTOTGAT 


occooaotto 


1901 


ictttocaqc 


xarroTCCAA gckhaocct 


OTTCGCTOO0 


WTOTTCCTC 


1S31 


ACCAOACAA* 


AOrCXCTCCC CCCATOOTCA 


CTCCACTTCT 


CTCCACCCCC 


2001 


AOATTAACAa 


02AATTCTAC tctcaoaacx 


TCAOACAAAO 


ACCrcCTGCT 


aasi 


TaOOAXACrS 


QTaaAAOAOA AAAsriaooa 




AATQETXTtQ 


3101 
2131 


7ZQOGAATOO 


T9AA00ATOC ATTAAAAATT 


CTQTOGCACA 


CATTTTAACC 
OCWCAOCtC 


i^AUUlWXW AOTOCABTVT TXWOTCCOT 




2201 


::ocaaocwa 


AQCCT0TT0O CTQCOTQTQT 


TQOTCASCAO 


ACAATACCCT 


22S1 


;;tccccccxt 


OQTCACTCCA CTTCTCTCCA 


cccccaoatt 


AACAOGTAAT 


2301 


TCTACTCXA 


OHACATCAOA CAAAOACCTC 


CTCCTTOGCG 


AAOTATCATC 


2JS1 


.umxcoocc 


ATCTCTCACT CtACACCAAC 


ACTAAOOOTC 


WtOACTAO 


2401 


CWAGOOGO 


ocoATAttrrr oaocrorccc 


TBTQAQQATC 


ATQACCTATO 


2451 


Acocrra:cA 


QTATAOAOTA CTTCATTTCA 


ATACTCAA0O 


AATAQTTfCC 


2501 


CCAACCTICT W.TTACACCO AOTT.VQTOAA 
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'Tcxxooaazt cttoocctc.ua, CCCMUUUSU; ucacoocxoc 


31 


OAOJUiOfcXJk 


CAOaAOOGOO CtJCRCACOCA oqaccaooac tccccottcc 


10- 


AICOUAflAAC 


CA0OMAA6C OJOC0ACCTT OCOCTCTCXS CCTOATCCCT 


15- 


otgttqcooc 


CCTGAACATT OXACCTOGA QWXQQCQT TOCTCAAOaO 


201 




CTTCCCCACA OZACCAT7CT OaAACCCITO AMAaOAATT 


251 


OOTTCOACCT 


i/XUU>U M vaC OTCCACCXQA IVI'ACTaCQA toatcaqact 


301 


COOCaaAaO. 


TACAOCAIXA O2ICCACAT0 CCCOIOCACT OCXTCAA1XT 


391 


CC0GACC30C 


CATOACTOCC OOCACATCCA OCCTCCABAT OMAAQCCCA 


401 






451 


TOTOtZAOAOA 




SO I 


TTCCVJAACJ* 


AACACAOCTT ACOTTOOTTC AOCAATCCTQ TCWaACAOA 


351 


ctogvttqoc: 


cQccntxrca ccTcccTAoa caacctatoc tacaccsjacc 


401 


CCTCAGCTCT 


ACCOCTATM TCCATACAOC OXGCATACC CCOCAOOAXC 


631 




TXTOCCQCCA ACOQQCAOOC ATATOCAOTO CCATKXAO* 


701 


ACCO7EAT0C 


AOOMRtM G3*CAA£M3C CTQCCAACCA AOTCATCATC 


731 


raowocoro 


aCOJaOACAA T3AGASTCAC arXJCrCTOI 0CATUCTC9C 


eci 


cooo3c*acc: 


ACOTCA7CC CCCTCCOCTC TVIUrtimi OTCTTCTAOA 


851 


GccriaiAak 


TlTICTWUC ATACXTPtCTC TTAGriCCTOT 0T0CAOTAAT 


3C1 


TTOAITIOCJi 


COtKATTTCT OTTTOTOACA AGOTICTTTC ATAATAATTT 


951 


jaatac^tcv 


rriuuuju'iu gtaatctaat xaitotuact oacctqcatg 


1001 




OaAAOCCCCA CGTATGCICT cywaTQAfiAOC CTCAOTCCTT 


1051 


CC007MCTA 


CCTTCCACCA AdVITTCTT AOOGACtTIT G0ATO0CTXT 


1101 


'ATOTAAJCAi: 


ACCCaCTTAA atogccaatt TCCGTCCaOT taootocagt 


1151 


CrTTQAAlTAA 


OOQATOUITT 1CCTICCTAT GCCAATACTA ATACIOCTOA 


1201 




ttoxocaao luiu/iuAoa agagtcacac cttctteaac 


1251 




cncrAaacc ccnx-iuLar otiaccctag gaoctotogg 


1301 


CTOCTOOCH2 


CtOCAACACT ATO0T0TQAO OACCCTOTAA CCrTACCTCTT 


1351 


COACaiLTTA 


OGTACCCCTT CAACCTCCIA OOTATCACCA OCAOGATTGG 


14(11 




TOCAOAOGOC CACCCCCTCC CTTTAAAAAT TACGCTCCAG 


1451 


taaictcc; 


AarnTATTT iCTTtnTArr cmi^mu crmiiiOG 


1SC1 




CATTAiTTCTQ CW7TTACGAA TTCATTCGAG TGCCOOTOQG 


1551 




caqgcjuscpc; tccaoccucc TscrcrcAcrp crnvrmx 
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i«o.i o(7»'an» t rrgcoocrc 

l«5l TCTACTT3n. AAOAAAQTAT 

1701 AATOTPCCTC1 CCTCAOOCCT 

17 sl cocrrrrccr caocattoox 

iaci cotztxcao;: qaoaaatooq 

1831 * XASOXXOa UVVXIWJUJA 

1*01 ACCTCACAMS A9CTTAAAJUC 

Uit maascicju: juxcxoocrr 

2001 A03UCIC*Xn TtCCHCOOLT 

2051 CTCCCAMlt: CAXTXTOTO 

2101 COQCACJJUUJ CXOOQAtOOC 

2151 ATAdcnar ttcxccaooo 

2201 CTCAOCCCAtf TOCATOCOTQ 

22si rrccnaOT: AOAtCAAcxc 

23C1 AAOTAOMat OOTX7TOCCC 

2351 W Tl U UacS TOCCCATTTC 

2401 ATACAXCXXA TATQTOACAA 

24U AATAAOCAT3 ATCCICATTT 

2301 TCATJKTP3TC ACCA1TCCTA 

25T-; AACCXffCOGT TACWrTTCA 

26.J1 'XATTOJTAA AOCGA£ACA£3 

25i4 AAC/.GAATXO TCTCCTACIO 

2701 CCCCAArOCO GCTAOCtCTA 

27ui crar.XArc*a TrrofccnuA 

2HIH 3CA^CT=*GT TCCTCACCCC 

28KL OTTTICXCC OTCACTWA 

2901 AOOtHASy.C TOTCCACCA 

29.51 AAA'tWTOl.T CCWXOCTO3 

3001 CTOVTCTCAT TCCTCTCCTA 

JC51 A»:CT3ACO CTCCCTCAGA 

3121 TCATT&<TOC CCACAOTAOC 



AaocrccAOA oaocctatxo acactxittt 
O3TCWCO0A xTitaoajyoA occirorrra 

C5TO000CCC ACACXOCCTO OTCCTOCOQA 
A3CCAC0CCC UJIUUIWKJ CACA0ICTCA 
ArxaXTCAOA OCATCICAOa GMTTOftTCT 
TCGA0CC0XO TTCIC&OTCC AGACGGQCtC 
ATRCTXrtC CCCAAAWIU TO0CACTCTO 
ICTAACGAAA ATPtCTATTT CTAfcAATEAO 
TOWAATTrX lU ' IVJllUk TOTTPCfcAlfl 
ICIOCCATAT TTTAGTITCA AACAOTACCC 
JjCAOTSAAOQ CCCCCCQC O C tCt»CTTT9C 
TQCTCCCA9Q AAQCATtCA? TCTOAcTriU 

c f uueriux uuu/ r uccuj toxtouxtc 

WCAACAGAT CICCATOWA CACAOCT0CC 
ATAGTOCXTT CTCOATACIC CITO0OGACC 
OUJCTSACAT TIUiTlTll T GTCATCTCTO 
CATOOTACOQ ACOCCOTPCA OTOTCOCTIT 
CACATCCTOT QCTQtAXGAC TSCCATTTOC 
AAOCTCCOTO CTTTACTTXC AAAACACTAA 
CAflTCATTTT AATTTTAGAa TCAUTXACTO 
TACTOACTCA CATCCCXCAA G2ACTTTTOA 

'rcccATOOOT m \ kxc i u . t: rorcTOrxou 

CCAOOCAOCC ATAfflTCAOC CTGATCAWC 
OTATATACCC AOAATQCAAT ACA' l ' XVWC O 
TOATCCTCAT ACACQCCACC CTTtCOATCW 
'2CTCACTCUa TCCT tt COVXT CTTGJU3TTM 
'ITOAOGAACA WlVlUTltx: ACTG»OACTT 
AAOACOCTOO CTOCTOACAC TCCATC9T&3 
3TTCTtTCAT TTAAOAACC? TTCATATOGA 
ICOTCCCTM CGAAOAAOQC CTCGCTXAOS 
CTTCTOGAST CCAOCAAOTT CCHJCOTTTO 
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31S- JW3ACMAATC (TtTOCATTT ATTRCntC* S CWaW W O ACCCCXTOCA 
CCCTCICATI* rtOCCTTCCA <7fCI*COTfcO ATOMMhtO XMQOCMAn 

3«x amMia. uicuwi ' iu waroenc itTnxrtcra TttscAixcnc 

3301 CQMCCC3T3* TT CIIUJIW CtOTQCAXAC AQTXTOTTtA TMOTOXACT 
335 L TOTIAAAAXJ* TtAAXtCATC ATDUCCCTA. AAAAAJUUAA AAAJUUUUUA 
3401 AJUUUMA 
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i TAcco9rcca ojuncccoa ctcgacccac ocort cc aooQ ctocoacAag 

si acoacauaaji qsaqqcwjq ocAcccAnru auxcfccroc a«Qauna»a 

ici aoccttjcaj tcaoaaooga /ocacaaaca acxkgkx c A graaoGAOJUL 

1! 1 AACGAOOUA ACMAttACAA CC*Alt3QA£A TCCCiaAflQA OTOCTOOCCA 

3C1 aCCXrCTTUUa TOKXCTAAA AflACAICACT GAAATAA70G ACAfiACATCT 

251 OOTOUJltl OAACOOATCC TOrCTCACXO 1CAAAAOCIC TCAfiATOOAO 

301 ACCIWf!AA3 ATQOOCATCT f J Q A OU X * fOQ TOTTCCAOOO AATrtATAAO 

351 ACCAAOIAC? OUU3AAOCCT caTtCACAJUO AOttAAOAOC T&CTCM7XQT 

401 CCCTXA/1CA3 TXCTCM77XO CTO0CCC3WTT AGNI999CA aAOAEAAAAA 

451 CAAAMUTT TTCA 1VJ1 l ' l ' l ' CAAOMCAAO CC AU»mA C ACAGACTATA 

501 CAOM»7IOQ 9XTTCA£TTC AACCCCCITA <7TEAAAOGA3 AAOGTtQOOQ 

551 ACmatCXA CAAOCIOOTA TOQQtfCACAA CAAACAOACA QAATCTQAA5 

601 ATAACCICCA 9XCCCA3TCA AAOCAAACTT ATTnTOCTC AGCCAOOTTC 

651 AGCLACAXCC CATtOOCCAC trflOCCATTTT CACAZCAA1Q AlCTTGAACT 

"HI CTCCCAOOCT UC T t T lUX OQ AACXXAAAM3 TATTOAAflAA ATCCTOOAOC 

751 ACACTACACA CCACCOACAT (^ACXACCCT TACTOA0OCA CAOSGCTAAA 

801 AACVTTITCC ACAOtTTITOa tnCTCATOCt AACCAA0QCC CTOtGCACCT 

B!il OOCXXXHATC TACTT3CTQOA AAOCCATTIC AOAADOTtTC AAAACIUAOC 

901 ACT^OOrTGA TOTAAAGCAO CAAOCAQAAC AOTCXTIGAA TATTTACATT 

9fil ATOClOCAffXT ATACTOSCTT 'COQA0TTAAA OTtTXTtOCOA (7IOTAAATAT 

1001 AAChAAlTCA AAATCAAAAA i^OCATTTTA CAGTAAAACT CATCTAAACT 

1C!j1 COCAAArOA OC7TACAACTA T CTCT A QCCA ACATA07IOa ACCAOCAGAA 

1HH 3GA<^T»A TTOCCCAOXO i^CACCTQOC CTOTACCTA QCAATCAAAC 

1 1.-3 1 CTQOTCR7IT A17SATA0GA .AACTCCAOTT 09TACCTATT TCCGACATTA 

120 1 iccrTrroo tcacaoaact »saatttaaga atoctcttca actooctaac 

1251 TOCtrPCAUa ACGACIXCAC rOCJCTGACT GAACDUJCIO CCCAGATTCA 

13'H AOAiU3Q3CyA GAATTTCTC2A C TQ C T A CAAA ACAACCTAAG CTTtTCCTAA 

1XU AGAATCIC/A ACQCTOOCAO -TXTTCCCATC CTOAA6AACA CCITAOQAAO 

1401 77A«TA3A3T TCATOC/^AC ATTCAGTCAA AAAATAAAAA CTTATOACAT 

14 31 TXtMATTAiC ACATOCCTCA tAOATTOOOA ICtOCAAAAT TTTCTAATAA 

12 31 ACA'tTCTCJA C TT l 'I OC AAA AATTCACCCA CTTATAAAAC tCACTTTArr 

SEQID.No: 21 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



3«**ion Vimi crick. prlne*ten.«3u 3 



1551 AAATcrcaar roracAocer iciacaacct catctctaca amstoacaaa 

lcoi cxrrcciowa ocacaaicca tcatacaoto qatcaatgao tcaoaotcac 

1631 MOAAOMCr AOICAAAATT JLCCTCATCTT CTQWWTCAT TAACJUlCTTA 

1701 AAGAAAACCC AOJUTACCT AGTTXGAAAA CTOAO C CCC C 

1731 AOAAAOAOT3 OAACAAOGAA / fiA Q AA T OQC TACATXTOAA C7PCACCACAG 

laci crcrcAner: crrerrcAAO TAOcteAana aaagacaoca qcgmmcaiq 

lssi caoctocto; tactctbcax locrAcroorr gtaooctaic aottoctaaa 

13 ci cAorxmrr caocatctxc tookiokul toaottaaac ttcctcttoo 

1971 ATCXAATO3A AAA331AC3AA CATXAATOCC AASAACXQU AAACTTXTOC 

2QC1 WI tem a CCCAflOCATT CITOJIUC Be ACAOCCCtlA OAOCCACAGT 

30! 1 TOAAATCACA GATGrPFICTA CACAAOAOAA AOOACAACOT TTOACATTAA 

a 101 TAG>AC\ACA TATOQOQTCA ClWiurdU AAOAAOTT3C ACATQTCCtC 

21M ACMAACAT3 OAOAACATCA TCACTOOSAA A00CTO0AOA ATOATTTGAG 

2201 ATIXeTCATT OAOQQCMACr A2AAA0CCAC CACCCA1TDC TTACAAATOO 

2251 ATCAAfiTAAA AAAACAATK3 CAAAOTTOlT OCCA1GAAAA GAAACAGACT 

2301 TA17AMAAC AA00XAATCA AAACAOAACA AAAOAAAIOA TACAAAATOO 

2 3 HI ACA1T7XTQ OACTTXCTCC AAUtfTMGO CCTAflACAAT TACTATCCAA 

2401 AAAAAA.WA AAAAAAAAAA AAAAAAAA 



SEQ ID. No: 



21 (cont'd) 



FIGURE 24 



WO 00/1 1 168 PO7US99/19052 



4**«ion Kami crick. pxinc«tan.«Ai 1 

1 ACXaOACtCl CQTAOAOAAA AAAOIOCCPC QtUCTCGCTS CXOAAGAACC 

3 - T=Mwcrca. ccaoaototo awcTrrrcA kaaaaacxca crni»UACO 

101 TCA0UAAACA OCICATTOCT tartCTAACr TATITOIOAT QCAAOKnAA 

1*1 AIMMQmr AtACAOCTCT XUAMOXQQ ASOITCCTXC A flCKUTOC C 

aoi otextooaaii' oaooicma aaaacaoctt ttcacaaaaa cagatcicxo 

25.1 GTITtCCAAC; TQ O AJUU U M ATXTtUAAAO Otfffl UJ ' tfrrC CNTGAAACTO 

301 AACAOQOAA/, ACGAGNTOOC 03COOTIWC CQACA3TTAA OOCTACAflTCA 

331 CVTEVrt*2T OATCHJOCTT CIOCAAOOAX CA7XBJU3GAC Q A XTCTCTOa 

401 TACCTTCAflA A TOJCmULU QCMTODITX AACMCAtWq qCICSCniTO 

451 C TAO3 0C CT 0 fc v:u fl>aua xauasoaca cctcaaoua tcaaxaaao* 

SOL AOAACITaWI OQAAACAflCA TU09IOXQQ TCGAAAflCTT OCGAAAOATO 

53L OTOfcTracm cwooocrao acaooctica Arncoecrr laxccrocfrr 

«01 CTQACTTAC/k CCAATCOATA CATOWTtTC CAAOQCCAXA CNCTOAACCA 

651 OCCAOTtTAt? 1Q0ATCIOIC AOCmCACC TCQAAOGAOC ATaOCATTTA 

701 AATOCOCWEJ OCTPCTXTOA CAOTAOTOOQ A 



SEQ.ID. No: 22 
FIGURE 24 



WO 00/1 1 168 PCT/US99/19052 

C3. 



S«««lon Ham i criok. princftcoe. 1 

I Gwarmu* octac*»u= vrnxcxm atcotjcttc tscaaooaxc 

10- AOUXJttXOG CTOOCBOTC TAMMCTGA ACAWWWT OMOTOOOGC 

151 CTO^CLAAAT CJJ^UUCAA aOCTItlAOO OAAACAOCAT CXOOTWOCT? 

301 CONUOCTO CCAAMSXTtt TOWaCTOC TOGCOCrOOA CACOCXTCAA 

251 TITOJOCTW cyiCCtCCTIO tOACmCAC CXATCOAW AlCATTTTCC 

301 AACOCCATAC tCIOAACCJU! O^MXJTtAtT OOATCTOTCA OCITACACCr 

381 COAAtfOANCfc TASCKXTO* AIOCCCITOa CTTCTTrOW AflrTAOTOOOA 



SEQ.ID. No: 23 
FIGURE 24 



WO 00/1 1 168 PCT/US99/19052 



- TCAOOOCIAC OCAATtAAM TilMTrtKOCT CATTCXma OCACCCCAflO 
3- CrmCXHT TATOCTTCCO Q7ICCTA70T TCTCTOGAAT TTOttOOOOA 
1Q.L TAACAATTTC ACACAOQAAA CWTATOAC CATQATEACO CCAACCTCTA 

15 1 atacijacjca ctAT^axaa jwctootaco cctccmotx caxrnxooA 

10.1 ATtCCCCOCT COACCCACOC OXCCOACOOC TCXSAOAAAA CCACAOAAfltt 

351 OOQCUXACA GAOGACCCCT T3AACCTAAO GAAACAOCCA QOCTtCCAlX 

30 L OOACCAIOCl* O CASA qy T OJ CAflAAAAOOO AflATCMCAA CFTXCAAXKC 

351 CTCATOTACC TCAACACACT OXCOOAAOO ACCTJCAATO ACTACAT3CA 

40t vuwjliuiu Tnccdrxc iccroacTQA ciTauxTca oraranra 

451 ACTP^OWk TCCOUWSKW tTCCOOOAIC 

SCI GACAXAMO AAAOOAAOTT CAAOTTTACC CACAOfflTA AAGATOTWA 

351 AAAOATIWt. OQAdACATOA COaTOCAOTO CCACTACTAC ACCCACTATT 

601 CCTCA0CCA'.: CATTOTCOCT TCCTACTTCXJ TCCOAATOCC ACCATICACC 

651 CAOOCCTIW OCTCCTtK* GOOCQOAAOC TrTOATOTOC CTQATAOAAT 

701 OrPCCACAGT OTAAAOAOCA COTOOQAflTC TOCCTCCAAA GACAACA1QA 

751 GCQAlGraw; QOAOCTOACA CCrtUATPCT TCTACCTOCC CGACTXTTTA 

801 ACCAACICTA ATOCAOTQGA flflTOCCIOC ATOCAOQATO GAACGACACT 

951 aaocuAiOTJ aocrrccrc cctoccctqa cooooatcco aogaaattca 

901 TCAOCrHOCV CAOACAOOCT C*X«AAAOT3 ACITO7TCAO CAOCAAOCTC 
951 CACCACTMA TAQACCTAAT ITTTOOOFTAT AA0GAOCAOQ O0CCOOCTOC 
1001 TOTAOAOOCA CmAAO^TT 1CCACCCCTA CTTCTACOOT OATAOAATAG 

losi Accroaxyu; catcactgac cooctcatca agaocaccat ccrooocnc 

11C1 ATOOOiACr TKMACA0OT OCCCAAOCAQ ATCTTCACTA AACXCCACCC 

1151 A1CCAGKA*: ACCACAOOOA AAAACCCAQ0 (XCWC AAJUa OATOCTCCCA 

UC1 CCCCTOTA03 CCTOCCA30C CACTCACAOT CCTTCCICCA CAGCCTOCCA 

1251 (JCACTGAGA: CCTCTCAQOT CACACTCAAA OATATOTfcCC TmCTdCT 

1301 AOOCTOiOAA TCCCCCAAAO OOOCCATCGG CCACATOTC CCTACTOAGA 

13S1 AOTCAA'fOCT OOCAOTOOAS AAOAACAA0C TOCTOATOCC CCCICTCTW 

140^ AACAOaUXT TCAOCTOOQQ CTTOATQAC TTCACITOCT OCCTOGOGAfl 

I41il CXAEOGrTCT GACAAOATCC TOATOACCIT TOAGAACCT3 OCTOCCTOOO 

1301 «7PCCCr3TCT OTOCQCTOtX ' KJ L U-1 ' H.u: CCACCATOAT coicacatcc 



SEQID. No: 24 



WO 00/11168 



PCT/US99/19052 



S**mion Ham: crri iX.princ«toji.*Ai 1 



lss:. oooqccaxg CA mwivm cvtctcggw ctoaocctoo tcaaaoctcg 

L60a CCCQAOJOOrr STGAAACTCC atCAOGCCTT OTtfOOACAC ACTCA0OCOG 

L65.1 TCACATOICT aACAOCCTCT aiCACCrtCA OCCTCCTOBT OUtCOCATCC 

L701 CAOOATC3CJ. ITlWAfrX T CWOCACCW OACCACCXCT c x otfitriaoc 

L73L CTOO WiCl T TOCACOOOa AAOOCAICFC AflCCATTOCC AtCACZQATO 

180 L TCXCOOCAAC CAITOTCtCC T3TOCCOOHJ CCCXCTTOIC CCTCTOOAAT 

L85L CTICAATCQM: AOOCCCTOGC CAfl tA Tl A CC ACAGCCTOOO GCCCAOAJMX3 

L90 L AACCVTAACCI TOCTOCTOCA CCCAOCOTW OATOCAAGCC 

L95t ACWiATC*? aU-UAXLAOr AAUUWLTJGAA TCCTXCOOSkT TTOOAMWCfc 

taOL CAOOACCTUi AGMOCCOT TCCCAOOCAO OCAOIGATOG AOOAOCCCTC 

1051 CAC93ACCa; CTMOCCCCA OA0QTCACXA OTOOOCCAAa AATCTTOOOC 

[191 ixaorcciai oeroaAiaxc aotottoctc toaoiqocaa cxxcaocaag 

OCAATTCCTU CTGrtTlACAOC 1CT0OCGATC ACTAOQAACC AOAOCAACCT 

1201 CtTOOTX»: QATOACAAGQ CCQAATCTTC TOCT0GTCT3 CTGATOOOTA 

1251 CttAOACMai HXmOAOXl ACCOACCTCA AACCCTOQA0 CCCTOGCOTC 

IJ01 OGCA3CJACA GOCTACAOOC ACAAOATGAT CTOTAOCCXO OOCXOCTtAA 

351 CGACAOC'AAtt VlVIUUCm; CCTCCACTCC ACACAGTTCT CAAOGAOICC 

401 croATtrrrrf gcaccottta ccctaaacat o t ckj i w ic tatomactt 

451 CTOIAAClAAlJ GATCTTGCIA CACACrOATO CTGGAAACIO ACOCTCCAT5 

501 OGAAATlTCr ACOTOGCTCA CTTCACCAAfi GCTTATTOCA CTOGGAAAAG 

531 AAOA0QCXX7r OOQ TA TTOST TCATQOAAAC CACCCCACTO ' IVi ' l ' lATlVl 1 

6G1 ATTAAAACTZ CATmCOOA JAAAAAAAAA aaaaaaaaaa aaaaaaaaa 



SEQID. No:. 



24 (cont'd) 



FIGURE 24 



WO 00/11168 



PCT/US99/19052 



Kdd2.d*4 Lanfftht 499 Au*vi«e 13, 1**7 lit 21 Typ«: K Cto*=k: 4033 .. 

1 CCCATTttTT COWCCTCCCA TOMXUCXOC ACCXOOOQG* OCQATXTXAT 

31 OTOftACJTOJ XTTXCrmXT atZMUkTOOM OW7TGWAAA CRC7TQOQMT 

1C1 CATAAAMAA OQGMAAGG* JJUUACAGK3 GGATCR9Q3 COOTKTOA 

151 CMCAAOOAt AOCCTCCCAC CAAACATCTQ CACTOCACIC CCAOtOOAAC 

3ci aao»ju^tiaa c T Aiuj ' mrr caocccctoa awioaocr taacxtgatt 

2!i rr»cJctor cracccrrxcc taatagaoca qmatotui tattattctt 

3ci ATrrxjuuas ttoutocct owctgcaat gootcmuuc tkcxcctcxt 

3M TiXTlVlVn 1 CGM09CRAT AMMMUf OGACOUUU3A GAUMJUJllA 

401 TCACXTCTA3 TCAAAAOAOC CMT9TCOCA OIKAACXCT TCAXAAOAAA 

ill AlfllJU LU AC AUMUUUM CMUUkACXO MCTCTICtO dTOAAAAA 



SEQID. No: 25 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



S«««lon ttamicric't.princttan.aAl 2 

34-A3-T7.S*] Lw^h: CM July 34, 1997 Hi39 typ«: H ChmOti 9984 

: co oo cffoa cc coaoAXimx cwuusrrccxT tcttocacac cowaaaata 

V, TQCONCAllAt OATAAOAAOT TTTTCTGAAC CCTTTOOtlAA OWACTPOCT 

iou caomcwi ^roortauaw oomaocica taatcctaoa ggacataatc 
15. AiorcoAftCA rccmaAcr catacagato tcaoctctit ccaoacaato 

201 OACCAAAXtK TOICTXATXT OUAAACTAT ATOCACAAAT TACAAAOAAA 

25i cnowxogL cprtcwpoo atccaaatoo jycwrrcMrr tarrcrrccr 

351 OCCTOWCTT AAACTOTTOO A2CTCC*OGA WWtWWJ AAACCWWA 

401 AOOSWrCKQA (UriCIWCA QtOOOACTAG AAAAAATQOC TAlWlOff 

43 L CATATCCAXU AACCOAOCTC MPJT1UW TTA ATAAAOICOI W»A»CCNAC 

SOL AAACWCTCXXl AOA1CAAAAA A33TCCACCC AOGAAKtTGA TCCCTTATQA 

551 AATOVAAATT QAtQCCCCCT GCCTXTiaAA TCAAOGAA7T OOCCAAATTN 

6CL AAOafrnTKI AAATVtTCAA accccggaac aaccgaatcc u*waaaacc 

6&x ccncouuaa aaaaaatht ttoggccctt aaaaam 



SEQ.ID. No: 26 
FIGURE 24 



WO 00/11168 



PCI7US99/19052 



34-C7.S*q Length: 451 July 24, 1597 11:37 Ty?*i M Cfc^ck: 6224 

1 CAAAA**(AC3f IVlVl T TA QC AAATAAAAAC TOCCCAflTOO GACAANCOTC 

51 qccgqoaa*: ocxotcAocA cioccaAvrr caidulatocc tooctcitoc 

101 GAMXMGAAA OAAASOCMC CCACAflACAO WttAAAAflQC CyWU A Q JC 

isi cwwiocccm: OAAACCOAAO AACCACANGA MCCC1CTCCC CCTICOOCCC 

aci cancan: oaacciuach ancaaatcaa ksqaomoam acccaocaka 

351 HCOJOCAAWr ClfcTWCTC CTOACTCGCT CGAMKAAAN CATCCCOGOC 

3C1 UOOGCAC2AAA AAATCCCTOO OQMCQCACA ACTOCCCCAA GCCQACGAAG 

351 AA LW3WW TCAAITVCT MV1NACAAC 1UH1U1UJI KEAWMCA 

401 MCAOfcMBAT QAACATCCAC JAOTCTQCCT TCTCCTQCAI WXMJaff r Q 

4*1 OOCTATtXMC A092AAAAAC ACO0QJCN00 OAACCOQAAA ANQGAAMAAN 

5dl t2X3*JCOK2A TWanOAWTC ISXCOOAOAC ACaCACTOT CKAOJAtCOO 

5iii 00OOC2EUNC (XUAXC HM tTXTCTTCCNA I TO CC OOOOS GAAOWCCOI 

601 AAATTCWQC CCCOWWOC IXNAO30GGQ OOOOOGSJCCN TXTTCTTTCT 

651 M 



TlNUR* mil 't/rj.rTi c C ta ./^ ii^nf 

SEQ.ID. No: 27 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



3«j«lon U«m t crick. prine« ton. *Ai 2 1 
34-P4.:;«q Lwigtii: 683 July 24, 1997 11;37 TVP«* M ChacK; 1900 
I GCOTC.TC/rrr iJOACTOCAGA UXJ&PQQQQ TOATCTCAOCC CXTCCTTCCC 

5i Ttiociiyaw cia*aoaior rtcrrccroc acccacmcc aooooqatoc 
io* CACircrrcc tcooocttqa, cu t u rrr cTC tqccoc»cc tcccpcccc a 

15:. TSTCJkTPXG ril/ll ' lUlUU ATQ01CAT7O CACTTTAAQA QCA5AACMQA 

jo:-. ' iViTJiA -rrr tocccoctto aauoctwtt oxacctcctc Tcwronrr 

23- OOACTCCVTC H ' lOflUl ' lC CWWCCTPQ CTTCTTACTO ATAATTXTOC 

301 TMXATrccr AAcrrncAA vacAtm? tttaactact atattgatco 

351 TOCTTDUUA CWXTPXAHC OWA1U1U1' ATTXCTCWX 

4ai ocrmrrr TOroaaccA rrtoroawc TrxTcrcrrr tatmtwta 

451 OOMftAATTC CCATOTEAM OUTCCCACT- OTASCMTtt TAXACAQACA 

sax ATAsurowi Toccrrmsc auaaaaaog crorarrraA aatcatccoq 

551 AWICBOCCAM GAAACTCriCt KAAAOCJAAA OJCCNACCtC TCTOTICCTT 
601 U- Wn MKA : TCKTCMCCO MttttAOOTO Cn^CC C CCQ AATTTTOGTT 
651 TOOrrWIOTT TMCCOAAAAA Tl ' ITIlUJlM TXW 



?AP- kinase 



SEQ.ID. No: 
FIGURE 24 



28 



WO 00/11168 



PCT/US99/19052 



S««aio& sum t eric k>f*tine«toci.«du 2 



33-A3-r7.S«q L*ngt>it 634 JUly i4, 1997 11:37 Typ«i M Cb»ckj 1627 

I AATOZACCTC: Cl ' MlAnCC CACnTCCTX NTCA'ITIUJU nCTGXTCTT 

51 GTCAtACCCJt QCCTOACOOC TICXCACTCC 00GMQ9CCT OTmCttK*. 

ict crn ' ivriTJ TiuyaaAMiza cwnviTrr ttgamuuht tcaaaaca x t 

isi Tiwjgjoai tracrrrrr tcuccacgak ccwaftwrrr tctaxuaat 

301 C03tWlTQT.f WWWl ' l * C tflTl ' l ' lWl ' ttMlTUTa C7ITOCC7XTOC 

331 CTTTmTTP TAXOTOOCTTT 0OCCCCXTGX AASOQGTtGCC CCACICACTr 

301 C I CKU ttBiH: GAAOGACXQ TCWWCCOCT CTTTOTCNOA MCTQAXCAA 

i5t ucrcroacrr Trrraaucr caroraucs TrrcMjmro mroioca 

40i jysAccccoa: octctoocwj caxctoccct qsaaccccam ccecwaiic 

4Si cxtcttrw: ' lwicc o ccc cjucmttaaju COTicwacw tccciawum 

301 GTTCTTOAM AUtfRAATT MATPOTTCCC MfPWPTMTO OOAAAAAATT 

551 AKCUC^CT: OWCCAATTT IVrXCTDIQf JWXOQOCXO* AAflAAAfflCCT 

6ci wraccua M* amtx ' rnu aocw 




SEQ ID. No: 



29 



FIGURE 24 



WO 00/11168 



PCT/US99/19052 



S«**ion N«m i cr tck.. princeton. a&i 2 

38-03.3«q L»rtgth: 641 July 24 , 1997 11:37 1Yj*s N Ch*ek: 636 
I A3CCCACACC AOQCAACAAO GTCCTTTOM CTCCTATAATG AAMAtCXOC 
51 ATXCXQAT3A GAATQAOQAA QAATCAGATCJ AAffXOQJUMA TA*CGACAA,f 

io a oacxctqaaa tooaqaoqcc totjaataoa gcakscaocc oAAcrcocAa 
i5i AJjraocrrx ujroxsaacx ocatiwnok aaocwtxct ecricrrcAC 

201 CCCDCATCA CQAACCTCCA CCACCCTIAC HAAAAACCAA CAACAACCAO 
251 ATTCTXaUW TQAAAAOTCC EOltAAAOCAA AOCAAAflCAfl ATAAOCAAAT 
201 AAAOMFXTl OAATOTOACA AfXJCATACCT AOATGAACXQ CTTW1AOCTTC 

351 catmunrr rantacxTW aoaoaaagac acattctoca qcmaicotj 

401 AACCTTAMG AAOAAACZTU OACACPITCA TASCACCAAC ACAACATITO 

451 A A ' mU AlCT r T UflUQCBO aVCAAAACCA CAOfPCCOOTA ACTACAOWT 

501 TCCXOOAAAC: ATCTOGAfcCA T2CXOAAQAT ATAACCACTG OATOCCTQIA 

551 CAACCAATOT (mTTUMW ttAMTOC7IT MmWTO OCTCTCAATT 

601 nrctTCCCT TtaXTTAAAT O AAAA CC CCC attATOATOC N 

LT$?/*llt3 

SEQ ID. No: 30 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



Soilon crick. prlnc«ton.«du 1 °^ 

1 OCOAOCOaAC OaAOMMCTC TACQCTO O XT AOCCWTCC OOAACCTWtC 

31 CCTCOAATIA OCCCACnCAO OCAOAGAOOO OQOOOOGNTr QC IHXtAXX 

101 'I iwrWOAO OOOCTOCASrr OCAUOOOAO* CCCAOCOOCA OATTCTCXtCA 

151 CftCACOAOOQ AGAAOOCOTO AOOAGACAAA OCCOTCACAT CCOCOACAOC 

201 TTCCTTCAOC AOCCTCCTCC TCTCCAGTCC KMCCOACC CCCQAOCCCC 

331 TOMOCKTCC CTPCOOrTCTT COGAACACCC TMTCMTCA TTQCTAKCAQ 

301 QOOATCA-mX OOOftOCCTOT AAOTAOCCAO TOCAOCTCCT TOl T mCJO 

351 CttMGATOCCC ATCCCACAAC TWATCTOTC tJOAGCrOOAA OCCCTOOOCC 



SEQ. ID NO: 31 
FIGURE 24 



WO 00/11168 



PCT7US99/19052 



<V91 



3*«ion Umnmt crick.prii5Eiton.fldu 1 



P*9« I 



X ACQQCT9COA OAAOACQACA GAAO0GAAOG TAAXQCTQQ TCOCTOCTIC 

51 T0AAC0CCOQ OTttJOACOfcA CCOCTtTOC U CCCTT3CCQQ ACOTOAOCO0 

101 AAAOAAOATO OCOQrPOCAOO TOCnOCAAOC TGTOCAN0CO 0TTCA.TCTTO 

151 ANTCIOACOC TTTOCTAHTT TOTCTCAACC ATOCXCTQAO CACAOAAAAN 

201 OAMSAAOTOA TO Ott llH/ll i TATAOOOOAN TTOAATCATQ ACATAMOA0 

351 TQACPCCNAA TTTA£ATACA CTQCAACOM AATOCOCACA OTCCNAOAAA 

301 AflATCOATAC CATCAOAATT GTTCATATCC ATTCTOTCAT CATCTXOCOO 

351 COTTCTOACA AOANAAANOA COCTOIWAA ATTTCTCCAN ANCANCTOTC 



SEQID. No: 



32 



FIGURE 24 



WO 00/1 1168 , PCT/US99/19052 



S*a.ion Noma: crick. pr inctt on. «du 1 (^.L^'^G i 

1 GATOOCOOTC TCCACAOQAO TTAAAOTXCC TCWWTTTP OULWlll ' Mtf 

51 AAOAACTTCA AOAA0GACAA AAAOOAOTAO OTOATOCTTAC lOTlA B C TOQ 

101 OOOCTTOAAa ATOATOAAOA GA7QAGACTT ACAAOTXOOA CAOOCATOAT 

191 TATTOOOCCA OCAAOOACAA ACTATOAAAA CAGAATATAT AOCCTOAAAO 

201 TAGAATOTOQ ATCTAAATAC CCAOAAGCTC CTCCATCAOT TAQATTIOTA 

351 ACAAAAATTA ATATOAA1X5C OATCAATAAT TCCA0ZOQAA TOOTOCATOC 

301 ACOOACCATA CCAQTATTAO CAAAATOOCA AAATTCCTAT AOCATTAAAQ 

351 TCATACTTCA AOANCTAAOA CTCTTATQAT OTCCAAAOAA AATATOAANC 

SEQ ID. No: 33 



FIGURE 24 



WO 00/1 1 168 si-/ PCT/US99/19052 



Sainton tunmz crlck.prin£«eon.«du 1 

1 ACOOCTOOOA OAAOACOACA GAAOOOOOAO C f l CTO C Wt! CCAOOOACTC 

3X CAOTACACCA CCAXCOCOOA TTCTOAOCOT CTCICO UCC C r Tlfl U ' M iri T) 

101 OTtAOCCTOC ACCAOCrrCT COCOGACCAA AAAOOOMTT CiLlWATWI 

151 CTOAMATTAT ACtOTOCCTO OTCATCTTOA TITOCTICAO TOCATCTACA 

301 ACATCOOCCT ACTCCTCCCT CTOQCTQATT GAOATWICT Q TOCIOCTOX 1 

351 CTTACTOTO TPCTACACOT OTOACCTQCA CTCCAAGATA TCAPTCATCA 

301 ACTOOCCTTO OACTCACTTC TTCAGATCCC TCATAACAAC CATCCTOTAC 

351 CTOATCACCT CCATTOITOT CCTTOTAQAA AOAAOAAOCA OOICCAOAOT 



SEQID.No: - 34 
FIGURE 24 



WO 00/1 1168 y ^ PCT/US99/19052 



s~.ion n«., crlck.prlnctoa.e*. 1 CZ'^S 
1 TACDCCTOCG AOAAOACQAC AOAAOOOOCT CCOOCTTCTT CCCCACCTCC 
St OOCTOCCOOC (MCACC07PC TTPOCAOTCC QQOACCGOGT OOQATCOTTA 
«1 CCOCOOTOCA COAtOCCCCC OAAAAACUOA OOAOATOOAA TTAAACCOCC 
" X f°GAAIBWT OOAAOAITTO OAACCTCACT OAAMTIOQT ATCOTTOQAT 
30! TOOCAAATOT TOOOAAATCT ACCTTCTtCA ATOTAITAAC CAATAOTCAQ 
251 OCT1CMCM AAAACTICCC ATTCTOCACT AXIOATOR'A ATOACVOCAC 
301 AGTOCCTOTO CCAdATOAOA OOTXCOACTT TCTTTOCCAO TACCATAAAC 
3S1 CAtCAAQCAA OATCCTOCTT TCCTAAATTJT AATGQATATF OCTOQCCTW 



SEQ.ID. No: 35 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



* M " lon »■■«* crick.primr.ton.^iu 1 C 5*2 5 



1 ACOJUv^ OAAflAOQACW OAAOOOCTCC U U ' UX ' m: OCOOCOTTTC 

si Aji^ Wi ,. 14 uonoooet orcotcNwr cocnocoMA ccoococacc 

101 OCAOCCATOA CAOAMKTOA TOWUTCCQ AMOXTAW CCCKHCAKA 

151 ACCAAOAMC TCCTOOACCT WTCMCKO TCTTWTTNCT 

a« ACAANCAOCT KN0AAAO0A CCKMXJMCC WCAAAACCC IC/UCAOAAQ 

"1 CATCTCTOWV TTCATTOTO* TCQCCANAN CTOAACCTTO OAGATCTCCT 

301 OCACCTCCCT C1HTTOTCOA AAAAMNAAT OTCCCTACCT IRTTG>ndOi 

3!l CCA * C A03CT TTAMdAAA^X} CCIVKXWTT TtXADCCKTC ATCCCItTTCT 



SEQ.ID. No: 36 
FIGURE 24 



WO 00/1 1168 Q 9/^ j PCT/US99/1 9052 



Saw ion Hmt crick. prii-^con.«du 1 LL 2.' 0°) 

I OAAAOQCOOC TAOATACCOC OATACTtiOCO OOlA'AVVIX Xi AOCTOQfcCTC 

si Tccrorrcrr occacattto actoaaoaao oacaacooco tcaaootito 

101 AOOACTOACA OATXX7TANKC CCAdCCTTCC •KNOCCCCCA NAAA0CGAOC 

131 ANACCICACA TOGAACCAAA OOCCCCATOC CCAMCCOCCO 1CCCCTCACA 

201 OOAMAWAAA TIC O/ R OTC T U/ ft W W CACItXXJLOC CtOOCCOCTC 

251 TOAAOCIXXC TCTCCIWTA TCTAJWCTtJT TOCACQTOX TCNOAOTtOO 

301 aaocqcoott ctoacccoot tctccacatt oacctocoqa tttioooctoa 

351 OCTCATOCTA WU xniXXX ! TCNATOCtWA CACICIWOO AAOT 



SEQID. No: 
FIGURE 24 
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WO 00/1 1168 , ^ / „ PCT/US99/19052 



S> 

S«aaion Hue: crick. priiic«eon.«du 1 ^{&~A54> Pa9# A 

i acooctocoa OMCAaaxcai oaanoooqoc aooatoqctc oarotwACAo 

SI TTCAAOA.TCA ANAN9CATAC CCCACTOAOC AAOCTtQATOA AOCCTACflQT 

101 CAOAOOCAOC OCTTOTOttT NANOCAOATT CQAft UW t TTOAIOCMCA 

131 AOCAATCAAT OMACAOACA CTCCACCCCA NCTOOAOATO OAaWIGAOQ 

301 ACACCMPTOA TOTATTCCAC CAOCACACAG OAOGATCAMC OTOCCOAOQO 

231 AOCOTCCCAC ACCCAACCOT TOTCCTQACC TOTOCTATTC AOCAOTOACC 

301 ATOCTOCCAC ACCCA 



SEQ.ID. No: 38 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



3«««ion Nm#; crick. princ«can.*«»u 1 

1 AOOOCTOOOA OAAGAOCUCA OAAGOOOAOA GCATCATOOC OCTAAOCOOT 
31 OGACTOOCAT TOSCOOCOCT CAOACTCT30 QQTCCGQOAO QATQTATCTQ 

ioi ahctocitoo qaccakahcc TOCAarxexw tooto/tcat Oknujgoc i uj 

151 QOCATCANCA CACCCAOTOO CATCOCOCUC TICAOATCCC CAOOOQMOQ 

201 OCCTCTACAO CAACCTTCAG CAOTAIUACA TCCOTTACCC TOAAOCCAIC 

331 riTOAACTTO OCTTTTTCTP TCACAACCCC AAflCCCTTXT TCATOTIOOC 

301 CAAOOACTOT WXCTOW0CA CTACAOCCCA AWTCACTCA CTACTTCCTO 

351 AAOCTCCTCC ACGACANOGA CCOCTTCTGC OCTCWTACA CAHAAATCOA 

SEQ. ID. No: 39 
FIGURE 24 



TV 

1 



WO 00/11168 



PCT/US99/19052 



Seaslon KuMt cr lck. peine* ton. «du 1 



LU-U 



F*9« 1 



i ooccaicaan oantxwrm otcatooccq actcctcaaa oatoqacccc 

Si TCTACCOCCT CAACACTAAG CCAOCCAOTIO CTOAAOOQAA A0AOOOCXOT 

101 OAOTCOCTC? OOTOTTTACC TCCCOTCTCA CTOCITCCAC ACOANAAAO* 

A31 CAAOCCOOTT OCTOAACCAA TCCCCA1CTC lAGCTPCTOT CPTOQTACAA 

301 AAOAACAGAA CCOOGAOAAO CAOCCCOACO AACTCXTTtTC CTOCOCOOAC 

251 TOCOOCAATA NCOQTCAfCC ATCOTCTTPA AACTTCTCCC CANAOCTOAC 

301 ACmSAOAQTO AAOCCTTACN OTOOCPTNCA TTCWrtOTAA AACATCACT 



SEQ ID. No; 



40 
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WO 00/11168 



PCT/US99/19052 



>SC| LLS-96-fuXl.seqJ 14350 
AAAfiCTCTTSCTAMGCCTOW^^ 

accctcttttaaaaaSSS^^ 



SEQ ID. No: 41 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



>SC| LL5-96-£ull.aeq| 14350 Frame 1 

M FQAAGAAQAT P S HEAKGS S GS STVQ RS KS FS LRAQ VKETCAAC 

QKTVYPMERLVADKLIFHNSCFCCKHCHTKLSLGSYAAMHGEFYCRPHFOOLFKSVGwvn 
^GRKQHKELWAHKEVDSGT^ 



SEQ ID. No: 42 
FIGURE 24 " 



WO 00/11168 



PCT/US99/19052 



>SC|SA7-fuil 

AGCTCTAGTC CCCAGAGATG TCGCCACTAC TGCTGCTGCT GCTGTGCCTG 
CTGCTGGGGA ATTTGGAGCC TGAGGAGGCC AAACTGATCC GTGTCCCTCT 
TCAACGAATC CACCTTGGAC ACAGAATCTT AAACCCACTG AATGGATGGG 
AACAGCTGGC AGAGCTTTCT AGGACCTCCA CCTCTGGTGG CAACCCCTCC 
TTTGTGCCTC TCTCCAAGTT CATGAACACC CAGTATTTTG GAACTATTGG 
TTTGGGAACG CCTCCTCAGA ATTTCACCGT TGTCTTTGAC ACGGGTTCTT 
CCAACTTGTG GGTTCCGTCC ACGAGATGTC ATTTCTTCAG TTTGGCATGC 
TGGTTTCACC A7CGCTTTAA TCCCAAGGCC TCCAGCTCCT TCAGGCCCAA 
TGGGACCAAG TTTGCCATTC AGTATGGGAC CGGGCGGCTG AGCGGAATCC 
TGAGCCAGGA CAATCTGACT ATCGGGGGGA TCCACGATGC TTTTGTGACA 
TTTGGAGAGG C7CTGTGGGA GCCCAGCCTG ATCTTTGCTT TAGCCCACTT 
TGATGGGATC C7GGGCCTCG GCTTCCCCAC TCTGGCTGTG GGCGGAGTTC 
AGCCTCCGCT GGATGCGATG GTGGAGCAAG GGCTGCTGGA GAAACCCGTC 
TTCTCCTTTT ACCTCAACAG GGATTCTGAA GGGTCTGATG GGGGAGAGCT 
GGTCCTAGGG GGCTCAGACC CCGCTCACTA CGTACCTCCC CTCACCTTCA 
TACCAGTCAC CATCCCTGCC TACTGGCAGG TCCACATGGA GAGTGTGAAG 
GTCGGCACAG GGCTGAGCCT CTGTGCCCAG GGCTGCAGTG CCATCCTAGA 
CACAGGCACA TCCCTCATCA CAGGACCTAG TGAGGAGATC CGGGCCTTGA 
ATAAAGCCAT TGGGGGATAT CCCTTCCTGA ATGGGCAGTA CTTCATTCAG 
TGTTCCAAGA CGCCAACGCT TCCCCCTGTC TCCTTCCACC TTGGTGGAGT 
CTGGTTTAAC CTCACAGGCC AGGACTATGT CATCAAGATT CTTCAGAGCG 
ATGTTGGCCT CTGCCTGTTG GGCTTCCAAG CCTTGGATAT CCCCAAGCCT 
GCGGGACCCC TCTGGATCCT TGGGGACGTC TTTTTGGGGC CCTATGTGGC 
TGTCTTTGAC CGTGGGGACA AAAACGTCGG CCCGCGCGTG GGACTGGCGC 
GTGCTCAGTC TCGTTCAACA GACCGGGCAG AAAGAAGGAC TACGCAGGCG 
CAGTTCTTCA AAAGACGCCC TGGTTAGGGT ACAAGCTCAC CGGGCCACAG 
CAGCTATGCT TCTTTCCAAT TAAACAAACT AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA A 



SEQID. No: 
FIGURE 24 



43 



WO 00/11168 



PCT/US99/19052 



>SC|SA7-full Frame 3 

M5PLLLLLLC LLLGNLEPEE AKLIRVPLQR IHLGHRILNP LNGWEQLAEL 
SRTS7SGGNP SFVPLSKFMN TQYFGTIGLG TPPQNFTWF DTGSSNLWVP 
STRCHFFSLA CWFHHRFNPK ASSSFRPNGT KFAIQYGTGR LSGILSQDNL 
TIGGIHDAFV TFGEALWEPS LIFALAHFDG ILGLGFPTLA VGGVQPPLDA 
MVEQGLLEKP VFSFYLNRDS EGSDGGELVL GGSDPAHYVP PLTFIPVTIP 
AYWQVHMESV KVGTGLSLCA QGCSAILDTG TSLITGPSEE IRALNKAIGG 
YPFLNGQYFI QCSKTPTLPP VSFHLGGVWF NLTGQDYVTK ILQSDVGLCL 
LGFQALDIPK PAGPLWILGD VFLGPYVAVF DRGDKNVGPR VGIARAQSRS 
TDRAERRTTQ AQFFKRRPG 



SEQ.ID. No: 44 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



>5CI DD136I 14044 

cggtgacgcaggaccaggactcgcgcgtccagcggagaagcaggagaagccggcgaccttgcgctctcagcctgatccct 
gtcttggcggcctgaacattcgcagctggagagatggcgttcgtgaagagtggatggttacttcggcagagcaccattct 
gaaacgctggaagaagaattggttcgacctgtggtcagacggtc^cctgatctactacgatgatcag^ 

t agag g'ataaggtccacatgcccgtggactgcat caatatcc gcacggggcatgagtgccgggacatccagcctccagat 

ggg aagcccagagact gtctgct g cagatcgttt gccgagacgggaagacc atcagtctct gtgcagagagcacagacga 

tt gc ct ggc atggaagtttacact gcaggattccagaacaaacacagcttacgttggttcagcaat cctgtctgaagaga 

ctgcagtggccgcg7ccccgcctccctacgcaacctatgctacaccnacccctgaggtctacggctatggtccatacagc 

ggcgcataccccgcaggaactca,\gttgtctatgccgccaacgggcaggcatatgcagtgccataccagtacccgtatgc 

aggagtttatggacaacagcctgccaaccaagtcatcatccgcgagcggtaccgagaca^ 

gcatgctcgccggggcagccacctccatggccctgggctctctgttctgggtcttctagagccttcaacattttctgtgc 

at ag ctt ct gt tagtcct gtgt gcagt aatttgattt g cagggcattt ctgtttgt gacaagt gt ctttcat aat aattt " 

aaatagttctttt gaaggt ggtaat ctaataatt gtgactgacctg c atggtaccacaaagaaagc c cgaggtat gct gt 

gagtgagagcctgagtccttccgggtactagcttgcaccaagtctttcttagggacttttggatggctttatgtaaacac 

ac ccagtt aaatgggcaattt c c gt c cagttaggt gcagtgttgaattaag ggat gg cttt c ctt g ctat g c caatacta 

atactgctgatggaggaagatgtgtgcaagtgtggtgaggagagtcacagcttctttaactgtggattctcttctagacc 

cctgctgcgt gttac cct aggag ct gtgggct g gtggct c ct gcaagactat ggtgtgaggac cct gtaac gtac ct ctt 

ggagcacttaggtaccccttgaagctcctaggtatcaccagcaggattggctgctcaggatgcagagggccaccccctcc 

ctttaaaaattacgctccagtaatctgcccagttttattttcttgttattcttctgtttgcttttcctggggatgattgg 

cattagtctggagttaggaattgattcgagtgccggtgggtggaggcatgcagggagctgtccagcgacctgctctcagt 

gttt gt ttt ag gt at att g att gc cagctcaggctgcagagagc ct at agagactattttt ctactt gtaaagaaagtat 

ggtgaggggaattgaggagagccttgtttgaatgttcctgcctcaggcctcctggggcccacactgcgtggtcctgggga 

ggcttttcctcagcagtggaagggaggccccgtggctgtccagagtctcaggttttgagtgagaaatgggattgggtaga 

GCATCTCAGGGATTTGTTCTAATCCCTCATGTTATGGGGATCCAGCCGTGTTCTCAGTCCAGACCCGCTCACCTCAGAAG 
AGCTTAAAACATTTCTGGTCCCCAAATGTGTGGCACTCTGAGAAGCTCACAATCTGGCTTTCTAACGAAAATTTGTATTT 
CTAAAATTAGAGAATACATGTT CCAC GCATTTAAAATTT ATGTT CTTT CAT GTTTTAAAGCT C CCAAATC CAGCTTT GTG 
ACTGGCATATTTTAGTTTCAAACAGTACCCCGGCACAAAGGTGGGATGGCACAGTGAAGGCCCCCCGCCCTCTACTTTGC 
ATAGTCTTGTTTCTCCAGGGTGCTCCCAGGAAGCATTCATTCTGACTTTGCTCAGCCCAGTGCATGCGTGCTGCCTTGCC 
GCCGTGCTGCTGGGTAGCTCTTTCTTGGTCAGATCAAGTCTTCAACAGATCTCCATGTGAGACAGTTGCCAAGTAGATGA 
GGTGGTGCCCATAGTGCTTTCTCGATACTCCTTGGGGACCTGTTGACACCTGCCCATTTCCAGCTGACATTTGTTTTTCT 
GTCATCTCTGATAGATGGGATATGTGACAACATGGTACGGACGCCGTTCAGTGTCGCTTTAATAAGCATGATGCTGATTT 
TACATCCTGTGCTGTATGACTGCCATTTGCTCACAGTGTCACCATTGCTAAAGCTCCGTGCTTTACTTACAAAACACTAA 
AACC AGTGGTTAGTGTT TCACAGT G.\TTTTAATTTTAGAGTTAGTT ACTGGCATTCCTAAAG CCATAGAGT ACTGAGTCA 
CATCCCTGAAGTACTTTTGAAACAGAATTGTCTCCTACTGTCCCATGGGTGTGCCCTGCCTGTCTCCTGGCCCCAATGGG 
GCTAGCTGTACCAGGCAGCCATAGTTGAGCCTGATCATTCCTGTCACCAGTTTGACTTGATTATATACCCAGAATGGAAT 
ACATTCTTGGGCATCTCAGTTCCTCAGCCCTGATCCTCATAGACGCCACCCTTTCGATGGCTTTTGCGGCGTCACTTGTA 
C CT C AGT GAGT C CT GC GAT T CTT GAGTTAGAG GGGACGACTT GTC CAG CATT GAGGAAC ATGT CT C CTC C ACT GAGACTT 
AAAT GAT G AT GCAG GG CT GGAAGAG GCT GGCTGCT GACACTGCAT C GT GG CT GAT GTCAT T G CT CT CCTAGTT CTTTGAT 
TTAAGAACCTTTCATATGGAAGGCCT3AGGCTCCCTCAGATCGTCCCTTGCCAAGAAGGCCTGGCTTAGGTCATTAGTGC 
C C ACAG T AG C CTT CT GG AGT GT AGC AAGT T C CTGC GTTT G AG ACAG AATG GTT C AGATTT AT TTT CTACAT C T GTT GTT G 
ACCCCATGCACCCTCTCATTTTGCC7TCCAGTCTACGTAGATGAAAGATGAAAGGCAGAGGATGCAGACAGTCTTCTTTG 
TGATTGCTTCTGTTATTCTGTTGCATCTACCGAGCCCGTTTTCTCCCTGTCTGTGCATACAGTATGTTTATAAGTGAACT 
T GTT AAAATATTAAAT GAT CACTAAC C C T AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



SEQ.ID. No: 



45 



FIGURE 24 



WO 00/11168 



PCT/US99/19052 



>SC| DD136I 14044 Frame 3 
MAFVKSGWLLRQSTILKRWKXNW 

rOLWSDGHLIVYDDQTRQSIEDKVHMPVDCIMIRTGHECRDIQPPDGKPRDCLLQIVCRD 
G KT 1 3 LCAEST D D C LAWK FT LQ DS RTNTAYVGS AI LS E ETAVAAS P P P YAT YATXT P EVY 
GYGPYSGAYPAGTQWYAANGQAYAVPYQYPYAGVYGQQPANQVIIRERYRDNDSDLALG 
MLAGAATGMALGS LFWVF* 



SEQ.ID. No: 46 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



>SC|DDU6I 1404S 

C ACCACAG AG GAC C C CTT C AAC CT AAG GAAACACC CAGG CTT C GAT AGGACCAT G C T GC AGAGGT GGCAGAAAAG G GAGA 
T C AG C AACTTT GAAT AC CTCAT GT AC CT CAACACACT GG C C GGAAGGACCT ACAAT GACT ACAT G CAGTATCC CGTGTTT 
CCCTGGGTCCTCGCTGACTACACCTCAGAGATGTTGAACTTGACGAATCCCAAGACTTTCCGGGATCTTTCTAAGCCAAT 
G G GG G CT CAGAC CAAGGAAAGGAAGTT GAAGTTT AC C CAGAGGTTTAAAGAT GTT GAAAAG ATT GAAGGAGACAT GAC C G 
TGCAGTGCCACTACTACACCCACTATTCCTCAGCCATCATTGTCGCTTCCTACTTGGTCCGAATGCCACCATTCACGCAG 
GCCTTCTGCTCCTTACAGGGCGGAAGCTTTGATGTGGCTGATAGAATGTTCCACAGTGTAAAGAGCACGTGGGAGTCTGC. 
CT C CAAAGAGAACATGAG CGATGT CAGGGAGCTGACACCT GAATT CTTCTAC CT GC CCGAGTTTTTAACCAACTGTAAT G * 
CAGTGGAGTTTGGCTGCATGCAGGATGGAACGACACTGGGG 

AAATTCATCAGCTTGCACAGACAGGCTCTGGAAAGTGACTTCGTCAGCAGCAACCTCCACCACT 

T GGGT AT AAG CAG CAGG G GCC GG CT G CT GTAGAGGCAGTGAACACTTT C C AC C CCT ACTTCT AC G GT GAT AGAATAGAC C 
TGGGCAGCATCACTGACCCGCTGATCAAGAGCACCATCCTGGGCTTCATCAGCAACTTTGGACAGGTGCCCAAGCAGATC 
TTCACTAAACCCCACCCATCCAGAAAC^CCACAGGGAAAAACCC^GGGCCT 

C CC AG G C CACT C ACAGT C CTT C CT C CACAGC CTGC CAGCACT GAGAC CCT CTCAGGT CAC AGT CAAAGAT ATGTAC CTTT 
TCTCTCTAGGGTCGGAATCCCCCAAAGGGGCCATCGGCCACATCGTCCCTACTGAGAAGTCAATCCTGGCAGTGGAGAAG 
AACAAGCTGCTGATGCCCCCTCTCTGGAACAGGACCTTCAGCTGGGGCTTTGATGACTTCAGTTGCTGCCTGGGGAGCTA 
CGGCTCTGACAAGATCCTGATGACCTTTGAGAACCTGGCTGCCTGGGGTCCCTGTCTGTGCGCTGTATGCCCTTCCCCCA 
CGATGATCGTCACATCCGGGGCCAGCGCAGTGGTGTGCATCTGGGAGCTGAGCCTGGTCAAAGGTCGCCCGAGAGGTCTG 
AAACT CC GACAGG CCTT GTAT GGACACACT CAGGCGGT CACAT GT CTGACAG C CT CTGTCAC CTT CAG CCT CCTG GTGAG 
CGGATCCCAGGATCGC1ACTTGTATCCTGTGGGACCTGGACCACCTCTCTCGTGTGGCCTGCCTGCCTGTCCACCGGGAAG 
GCATCTCAGCCATTGCCATCAGTGATGTCTCGGGAACCATTGTCTCCTGTGCCGGAGCCCACTTGTCCCTGTGGAATGTC 
AATGGACAGCCCCTGGCCAGTATTACCACAGCCTGGGGCCCAGAAGGAACCATAACGTGCTGCTGCATAGTAGAGGGGCC 
AG C GT G G GATGCAAG C CAC GTGAT CAT CAC GGGGAGTAAG GACGGAATGGT TCGGATT T GGAAGACAGAGG AC GTGAAGA 
TGCCTGTTCCC^GGCAGGCAGTGATGGAGGAGCCCTCCACGGAGCCCCTAAGCCCCAGAGGTCACAAGTGGGCCAAGAAT 
CTTGCCCTGAGCCGAGAGCTGGATGTCAGTGTTGCTCTGAGTGGCAAGCCCAGCAAGGCAAGTCCTGCTGTGACAGCTCT 
GGCCATCACTAGGAACCAGAGCAAGCTCTTGGTTGGCGATGAGAAGGGCGAATCTTCTGCTGGTCTGCTGATGGGTAGGA 
GACAAGGAGCTGGAGGGACCGACCTGAAAGCCTGGAGCCCTGGGGTCGGCAGCAACAGGCTACAGGCACAAGATGATGTG 
TAGCCTGGGCTGCTTAACCAGAGCAAGTTTTGGGGGGGCTCCACTCCACACAGTTCTCAAGGAGTCCCTGATGGTTTGCA 
C C GT GT AC C CT AAACAT GT CT GTAGT CTAT GGGACTT CTGT AAGAAGGAT CTT G GT AG AC ACT GAT GCT G GAAACT GACG 
CTGCATGGGAAATTTCTACGTGGCTCACTTCACCAAGGCTTATTGCACTGGGAAAAGAAGAGGGGGTGGGTATTGGTTCA 
TGGAAACCACCCCACTG7CTTTATTTTATTAAAACTCCATTTTCCGAAAAAAAA 



SEQ.ID. No: 47 
FIGURE 24 



WO 00/1 1 1 68 PCTAJS99/1 9052 



>SCI DD116I 14045 Frame 2 

TTEDPFNLRKHPGFDRTMLQRWQKREISNEXYLKyLNTIAGRTYNDYWQYPVFPWVIADY 
TSEMLNLTNPKTFRDLSKPMGAQTKERKLKFTQRFKDVEKIEGDMTVQCHYYTHYSSAII 
VAS YLVRMP P FTQAFCS LQGGS FDVADRMFHSVKSTWESAS KENMSDVRELTPEFFYLPE 
FLTNCNAVEFGCMQDGTTLGDVQLPPWADGDPRKFISLHRQALESDFVSSNLHHWIDLIF 
GYKQQGPAAVEAVNTFHPYFYGDRIDLGSITDPLIKSTILGFISNFGQVPKQIFTKPHPS 
RNTTGKNPGPGKDASTPVGLPGHSQ3 FLHSLPALRPSQVTVKDMYLFSLGSESPKGAIGH 
tVPTEKSILAVEKNKLLMPPLWNRTFSWGFDDFSCCLGSYGSDKILMTFENLAAWGPCLC 
AVC P S PTMI VTSGASAWCIWELSLVKGRPRGLKLRQALYGHTQAVTCLTASVTFSLLVS 
GSQDRTCILWDLDHLSRVACLPVHREGISAIAISDVSGTIVSCAGAHLSLVWVNGQPLAS 
I TTAWGP EGT I TCCC I VEG PAWDASHVI ITGSKDGMVRI WKTEDVKMPVP RQAVMEEPST 
E PLS P RGHKWAKNLAL5 RELDVSVALSGKPSKAS PAVTALAITRNQS KLLVGDEKGES SA 
GLLMGRRQGAGGTDLKAWS PGVGSNRLQAQDDV* 



SEQID.No: 48 
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>5C | DD64 t 14046 

C AG AG G CAG G CAG GAC AG G CACT G CAS G CACT GAGAG C C TT GCAGT CAG AA G G GAAG CACA5 ACAG GAAGAG GCAGT GAG 
G AG AAAAGGAGCAS AACT GAGACAAGCAAT GG AGAT C C CT GAGGAGTGCT G G CCAACG GCT GAAGT GT C C CT AAAAGACA 
TCACT GAAATAAT GGAGAGAC ATCT CAGTCACAT GGAAC GGACC CT GT CT CACAGTCAAAAGCTCT CAGATGGAGACCTG 
GT AAGAT GG G CAT CTG GAG GGCTGGT C CTGCAGGGAATTTAT AAGAC CAAC CACCCAAGAAGC CT GATT CAGAAGAGAGA 
AGAGCTACT CAGTGT C C CT AAG CAGTT CTCACTT G CT GGCCCAGT ACAT GGCACAGAGAT AAAAACAAAGGAATTTTCAT 
CTTTT CAAGAACAAGC CAT GTTTACACAGACT AT AGAGAG GGTGGGTTTCAGTT CAAC CC C CTTAGTTAAAGGAGAAGGT . 
TGGGGACTTAGT CT AGAAGCTGGT AT G GGT CACAACAAACAGACAGAAT CTGAAGAT AACT ACCAGTCCCATTCAAAGCA ^ 
AACTTATTTTTGCTCAGCCAGGTTCAGCTACATCCCATTGGCCACCTGCCATTTTCAC^ . 
AGGCTGCTCTC C^GGAACT AAAAAGT ATTGAAGAAATCCT GGAG CAGACT ACACACCAC CGAGATGGACT AC CCTTACTG 
AGGCACAGGGCTAAAAACTTTTTCCACAGGTTTGGCTCTCATGCTAACCAAGGCCCTGTGCACCTGGGGGGAATCTACT 
CTG GAAAGC CATTT CAGAAGGTTT CAAAAGT GAGCACTT GGCTGATGT AAAG CAGCAAG CAGAAGAGTCTTT GAATATTT 
ACATTATGGGCAGTTATAGTGGCTTTGGAGTTAAAGTTGGTGCGAGTGTAAATATAACAAATTCAAAATCAAAAACAGCA 
TTTTAC AGT AAAACTC ATCTAAACT C G CAAAC C AAGGT ACAACT ATCTGTAGC C AAG AT AGGT GGAC CAGC AGAAGCAGA 
T G G AAT T G C C CAGT GGACAG CTGG C C TT GT AGCT AGCAAT CAAAC CT GGT CT GTT ATT GAT AG GAAACTGCAGTT GGTAC 
CTATTTGGGACATTATCCTGTCCAGTCACAGAACTGAATTTAAGAATGCT 
TACACTGCTCTGACTGAACTAGCTGCCCAGATTCAAGAAGGGGAAGAATTTCTGACTGCT 

C CT AAAGAAT GT GAAAGG CTG GGAGGTTT CT GAT CCT GAAGAACAGCTT AGGAAGTT AGT AGATTTT AT GCAAACATT GA 
GTCAAAAAATAAAAAGTTATGACATTTGGATTAAGACATGCCT 

GT CAACTT CT GCAAAAATT CACCCACTTATAAAACT CACTTTATTAAAT CT CAGTT GTG CAGCCTTCTAGAAC CTCAT GT 
C T ACAAAGT GACAAACTT T C CT CAG GCACAAT G€AT CAT ACAGT G GAT CAAT CAGT CAGAGT CAGAG GAAGAACT AGT CA 
AAATT ACCT CATTTTCT GAATTCATT AACACCTT AAAGAAAACCCACAAATAC CTAAT GGAAGAGAGTTTCAAAACTGAG 
C C C CCAGAAAGAGT GGAAGAAGCAAAGAGAAT GGCTACAT ATGAAGT CACCACAG CTCTCAGCT C CTTCTTGAAGTACCT 
C AGAGAAACACAG CAG C CAGACAT GCAG CTGTT GCTACT CTCCATTGCT ACT GGTGTAGGCTAT CAGT T GGTAAACAGTA 
TTTTT C AGCAT CTT CT G GGGTGT GAT GAGTTAAACTT C CT CTT G GAT CAAATGGAAAATAAC GAAC AT AAAT ACCAAGAA 
CT GAAAAAT ATTT G CAATT ACAGAG C C CAG GCATT CTTGGTGCT CACAGCC CTAAGAGCCACAGTT GAAAT CACAGAT GT 
TT CTACAGAAGAGAAAGGACAAC GTTT GACATTAAT ACAACAACAT AT GGGGTCACT GTTGT CT GAAGAAGTTGCACAT G 
T C CT CACAAAACAT G GAGAACAT CAT GACTGGGAAAGG CT GGAGAATGATTT GAGATT ACT CATT GAGGGGGACT ATAAA 
GC CACCAC C CAT T C CTTACAAAT GGAT GAAGT AAAAAAACAATTGCAAAGTTTNTGC CATGAAAAGAAACAGACTTATAA 
AC AAC AAGGT AAT GAAAACAGAACAAAAGAAAT GATAGAAAATGGACATTT CCTGGACTTACTCCAACGTTTAGGCCTAG 
AC AATTACT AT C CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



SEQID. No: 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



>SC|DD64| 14046 Frame I 

QRQAGaAIJC^RAtQSEGKHXQEEAV^lRKGAXI i RQAMEIeEECWPTAEVSLKDITEI^ER 
HLSHMERTLSHSQKtSDGDLVRWASGGLVLQGrYKTNHPRSLIQKREELLSVPKQFSIAG 
PVHGTEIKTKEFSSFQEQAMFTQTIERVGFSSTPLVKGEGWGLSLEAGMGHNKQTESEDN 
YQSHSKQTYFCSARFSYIPLATCHFHINDLELSQAALQELKSIEEILEQTTHHRDGLPLL 
RHRAKNFFHRFGSHANQGPVHLGGrYCWKAISEGFKSEHLADVKQQAEESLNrYlMGSYS 
GFGVKVGASVNITNSKSKTAFYSKTHIlJSQTKVQLSVAKIGGPAEADGrAQWTAGLVASN 
QTWSVIDRKLQLVPIWDIILSSHRTEFKNALQLANCLKDHYTALTELAAQIQEGEEFLTA 

RKEAKLFLjOTVKGWEVS d peeqlrklvdfmqtlsqki ks ydiwintcli dwdlqnflini 

VNFCKNSPTYKTHFIKSQLCSLLEPHVYKVTNFPQAQSIIQWINQSESEEELVKITSFSE 
cINTLKKTHXYUMEESFKTEPPERVEE^RMATYEVTTALSSFLKYLRETQQPDMQLLLL 
SIATGVGYQLVKSIFQHLLGCDELNFLLDQMENNEHKYQELKNICNYRAQAFLVLTALRA 
TVEI T DVSTEEKGQRLTLIQQHMGS LLS EEVAHVLTKHGEHHDWERLENDLRLLI EGDYK 
ATTH S LQMDEVKKQLQ SXCHEKKQT YKQQGNENRTKEMI EKGH FLDLLQRLGLDNYYP 



SEQ ID. No: 50 
FIGURE 24 



WO 00/1 11 68 PCT/US99/1 9052 



>SCI 191 14042 

TCTTCGAAAGCCGGGCTGAGGGGAATCCTGGACAGGGGAATCCTGGACGTGGAGATCGTGAGTCATGGCTGCTTCCCGAG 
ACGCTG ATGAGATCCACAAGGACGTT CAGAACTACTATGGGAATGTACTGAAGACATCTGCAGACCT CCAGACTAATGCT 
TGTGTCACGCGAGCCAAGCCGGTCCCCAGCTACATCCGGGAAAGTCTGCAGAATGTACACGAAGACGTTAGTTCGAGGTA 
TTATGGCTGTGGTCTGACTGTTCCTGAGCGGCTGGAAAACTGCCGAATTTTGGATCTGGGTAGTGGGAGTGGCAGGGATT 
G CTAT GT G CTT AGC CAG CT G GTTG GT GAGAAGGGACATGTCACCGGAATAGACAT GACTGAGGT C CAGGT C GAAGT GGCT 
AAAACCTATCTTGAACACCACATGGAAAAATTTGGTTTCCAGGCACCCAATGTGAC^^ : 
GTT GG CAGAGGCTGGGAT CCAGAGTGAGAGCTATGGTATTGTCATAT C CAACT GT GTTAT CAACCTTGTT CCT GATAAAC 
AACAAGTCCTCCAGGAGGTCTATCGAGTGCTGAAGCACGGCGGGGAGCTCTATTTCAGTGACGTCTATGCCAGCCTTGAA - 
GTGCCAGAAGACATCAAGTCGCACAAAGTTTTATGGGGGGAATGCCTGGGAGGCGCTCTGTACTGGAAGGATCTTGCCA 
CATTGCCCAAAAGATTGGGTTCTGCCCTCCACGTTTGGTCACTGCCGATATCATTACTGTTGAAAACAAGGAGCTCGAAG 
GGGTTCTTGGTGACTGTC3CTTTGTGTCTGCCACATTTCGCCTCT 

C G AGTTGTTTACAAT G GAG GAATCAAG GGACAT GAAAAGGAACT AATTTT C GATGCAAATTT CACATT CAAGGAAG GC GA 
AGCTGTTGCAGTGGATGAGGAGACGGCAGCTGTCCTGAAGAACTCACGTTTTGCTCCGGATTTTCTCTTCACACCTGTTG 
AC G C C T CG CTG CCAGCT C C C CAGGG G C C GTT CT GAGTTAGAGACAAAGGTTCT AAT C AGAGATC C ATT CAAGCTT GCAGA 
GGACTCTGACAAGATGAAGCCCAGACATGCACCTGAAGGCACGGGAGGCTGCTGTGGCAAGAGGAAAAA 
ACAGCCAGCGCGGLAGCCCACCGGGCTCAAGAGGGTGGCTAAAGGACAGTCACAGA 

T G CACAGAT TAT G T GAAG GT G G CAAAG C CAC CACAAG CT AGAC CACT G CT AAGAAT AAGAGT GACTTTT AGAG GAT GTTA 
ATT GAAGGT T CACAG CAAAT C GCCT GCTT TTCTATTTCT CTAT CT CAGAGTTCT G GTG CCACCT AGTGGTCAGAAGTAGA 
ACTTGGAAGCCCAAGGTTTACTCAAAGGGCCAAAGGCATCATCAACGTTGTGAGAATTATCTTCCTTCTGGCCTACC^ 
GGACACCTCTGGGTTCTTCTCTGTGGTTACCAGGAAGCACAGTACTTACTAAATTTATGCTAACCATGACAAAAGATTGT 
CAACT CAAATTT GCTAG GAGT ATT CTTT AGGTTGCTGT CT GCAATTTTTTT CTCT GT AACT GAATGAAAAAGAAAACAAA 
TAAAAAATAAATTTGACTTCGAAAAAAAAAAA 



SEQ ID. No: 
FIGURE 24 



51 



WO 00/11168 



POYUS99/19052 



>SC119I14042 Frame 2 
MAAS RDAJDE I HKDVQNWGNVLl 
VPSYIRESLQNVHEDVSSRYYGC^ 
TGI DMT EVQVEVAKT YLEHHMEH 
NLVPDKQQVLQEVYRVLKHGGEI 
I AQKI G FC P P RLVTAD 1 1 TVENj 
I KGHEKELI FDANFT FKEGEAV* 



• SAD LQTNACVT RAKP 

ILTVPERLENCRILDLGSGSGRDCYVLSQLVGEKGHV 
*G FQAPNVT FLHGRI EKLAEAGI QS ES VGI VI 5NCVT 
\ FS DVYAS LEVP ED I KS HKVLWGECLGGAL YWKDIAI 
XEGVLGDCRFVSATFRLFKLPKTEPAERCRWYNGG 
I EETAAVLKNS RFAP D FL FT P VDAS LP AP QG P F* 



SEQ.ID. No: 52 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



agcccccgagaagacttccgctt^S^ 

C ^CCACAATGCCCCACAoJ^icTcrr^^^ CAGAACA CCAGCCTGCCTGGGGCTCCGAGCTTCATCTTCT 

GTGGTGCASMCCTGCGGCTTC S^C^ScSSSSJ^;? 11 "''" 50 " 3101 " 5 
TGGCCTCCCCTGGGCCTTGGTC CTTCCO«cSSSS55S2J K °' :C,CAGCCTOSTCCT 

AAGAACAACTCAGACAGCGCCA «OCOkS5Sc5cSSSc?SS2 "°»"CTG 



rGCCTTGG 



GCTGGGTGTTGAGACCTGCAGACCCAAGCTGGGGTTA 



GTCTTTCTGGCCACCTCTGGG GCTGGGTG^GAGACCTG^Garrraa^n^^ 
AGGAGGCTGACACATCCGGCCTCT G TC^I^^"^^E5^ CTGGGGT TAGATCTCGA 



GCTTTGGCCTCT 
ACCTCCATCTCT 
AAGACAGCCCCC 

GAAGACACTGGGCTGCCTAGTGfACTAACGTACAOTrArrar-r-i^ 

TCTCACCAAGTCCT SGAGTGTC «SmSSS^S5S^ CTCCTCACCCCCAC CCCCGCAAAAGC 
CTCAGCCTTCAGGTTGGGGAGA ATA^CACTTC^TCTG^CTT^rar^^^^^^^^^^^^^^^ 
TCTGGAAACTG.AGGCGATTGX ^S^^^S^S^^ 



^CAAAAA 



SEQ. ID. No: 
FIGURE 24 



53 



WO 00/11168 



PCT/US99/19052 



>SC|28LONG| 14041 Frame 2 
MAVQVLRQMVYFL \ 

LSLFSLVQGAHSGSPREDFRFCGQRNQTQQSTLHYDQSSEPHIEVWNTEETLTIRAPFLA 
APDIPRFFPEPRGLYHFCLYWSRHTGRLHLRYGKHDYLLSSQASRLLCFQKQEQSLKQGA 
PLIATSVSS«QIPQMTSLPGAPSFIFSFHNAPHKVSHNASVDMCDLKKELQQLSRYLQHP 
QKAAKRPTAAFISQQLQSLESKLTSVSFLGDTLSFEEDRVNATVWKLPPTAGLEDLHIHS 
QKEEEQSEVQAYSLLLPRAVFQQTRGRRRDDAKRLL\A^DFSSQALFQDKNSSQVLGEKVL 
GIWQNTKVTHLSDPVVXTFQHQPQPKNVTLQCVFWVEDPASSSTGSWSSAGCETVSRDT 
QTSCLCNHLTYFAVLMVSSTEVEATHKHYLTLLSYVGCVISALACVFTIAAYLCS RRKSR 
D YT I KVHMN L LS AVFLLD VS FLLS EPVALTGS EAACRT SAMFLHFS LLACLSWMGL EGYN 
LYRLWEVFGTYVPGYLLKLS I VGWGFPVFLVTLVALVDVNN YGPI I LAVRRTP ERVTYP 
SMCWI RDS LVS YVTN LGLFSLVFL FNLAMLATMWQ I LRLRPHSQNWP HVLT LLGLS LVL 
GL P WALVF FS FAS GT FQ LVT L YL FS 1 1 T S YQG FL I FLWYWS MRFQAQGGP S P L KNN S DSA 
KLPISSGSTSSSRI* 



SEQ.ID. No: 54 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



CTCAGGGAAGGCCGATC7TCCGGGTGGAGGGGGAAGCGGCGTGACTGGAGTGGAAATTTTTCCCAACACAACTTCTCGGA 
^f^^ ATATTG ^ GG ^ CTCGGG ^ GGCC CGACTCCAAATGGAACT 

GAG ^^ CGCGTCGTGCCGCGCGCCCTCAATTACCTCATCGACCT 
^^ GGAGACTGGTTCGGGATGTTTGACGGCTATGATA ^ 

A f^^ G ^^ ATGGCTACCTGTGAGATGTGTGGGATGGTC 

r AG I^TII^ CTGTTCAAGAAGTTACTCGTCAAAC 

CAAAGAAAGCAAAAGTC ^ 

;rI C ^ GCAAAGACAAA ^ 

^IJ CA ™ CTCCAGTGGCCTGTTTTAAGCATGCACCTATG 

J AG . AAG J3 ^ CAATACAGACTG CAGTCTAC CTAC CAAAGT CTTCT GGATTG CT G GAATTAT AAAATT AGCAGGTTATAAT 

GGG 5ZT^ GAGATATGAAGGAT ^ GA ^ 

AGTTGGTTGGTGTGC^GCTACTG^AAAACCTCTCGTTCCTCCTAGAACTGTTCAACA^ 

T T CT AGT AAAAAGACTTACTGGT GCCAAAACACTT C CTC CTGATTT CT CACAAAAGGTTT CT GAGAGTATGCAATATCCT " 
TT CAAAC CTT G CAT GA GAGTAGAAGTAGTTGACAAGAGGCAT TTATGTC GAACAAGAGTAGCAGTGGTGGAAAGT GTAAT ' 
T G GAG G AC GACTAC G GCT GGT GTAT GAAGAGAGTGAAGAT GGAACAGAC GACTT CT GGT G C CACAT GCACAGCC CCTTAA 
T C CAC CAT AT T GGAT G GTCAAGAAG CAT AG GC CATC GATTCAAGAGAT CAGAT ATT AC GAAGAAACAGGAC GGACATTTC 
GATACACCTCCACACTTATTTGCTAAGGTAAAAGAAGTAGACCAGAGTGGAGAATGGTTCAAAGAAGG^ 
AGCTATAGACCCATTAAATCTTTCTACAATATGTGTTGCCACCATTAGAAAGGTGCTGGCTGATGGATTCCTGATGATTG 
GGATTGATGGCTCAGAAGCAGCAGATGGATCTGACTGGTTCTGTTATCATGCAACCTCTCCTTCCATTTTCCCTGTGGGT 
TT C TGTGAAATT AAC AT GATAGAACT GACTCCACC CAGAGGTT ACACAAAACTT CCTTTT AAAT GGTTT GACT ACCT CAG 
G G AAAC C GGCTCCATT G CAGCAC CAGTAAAACTATTT AATAAGGAT GTT C CAAAC CAC GGATTC CGT GTAGGAATGAAAT 
TAGAAGCTGTAGATCT CAT GGAGC CACGGTTAATATGTGTAGCCACAGTTACTCGAAT T ATT CACCGTCTCTTGAGGATA 

C^TTTTGATGGTTGGGAAGAAGAGTATGACCAGTGGGTAGACTGTGAGTCCCCCGACCTCAATCCTGTAGGGTGGTGTCA 
GT T AACT G GAT AT CAA C T ACAG C CT C CAG CAT CACAGT CAT CAAGAGAAAGC CAAT CAG CT T CTT CAAAACAG AAGAAAA 

^GCAATACAAAGGACATAAGAAAATGACCACGTCACAGCTGAAGGAG GAGCT GCTGGACGG GGAGGAC 
T AT AGCTT C CT C CATG GAGCAT C TGACCAGGAAAGCAAT GGCTCTGCCACCGTCT ACAT CAAACAAGAGCCAT GAG GCGA 
CTCGGAAACCATGGGCAGGCGGGGCTGTTTACAGGACTGATTTGGAATCAGCCAGCTGTATAGCGGGCTATTCTACTGGG 
ACATTTTGCTAAACACAGAAAAA^GTTCAGTTCCAGATTTTTCAGGTGGGGGGGAAACTATTTTGGTGGGGGGGCAATTT 
TT C AATTTAT AAAGAC G GACAATT TT TGTGTT GT ATTT GAAGCTTTT GAAAGAATTTTGTAATATTTT CCAAGTTT GGAT 

IZ A T GTGCATTGTTAACAAGAACTGAAATTATAACT 

ATTAAGAAGGATATAGTT GT AAAT G CACATGAACT GT CATT ACAAAT GAAC CTTCTTG GT AC CTGTTGG GAGATTTTTT G 
GATTTT AG AGTT AGG C CAGTCACAT CTC CAG CTT CCTTTGCT GCGAAAATAT GCAACT GAAC CC CT CACAGAGGGCTCAT 
CAGCATGTAGATCACGGAAG ^ 

CTTTTTCATTCTGCTAAAATTTATTTGGAGTTGTATATGAAACTAGGAGAACTCTGGATACTTTTATATGCTCTCCTTCA 
C G T AAAGAT G ATTAAAATTGTCT AAC ACTAAAGT GTTAAACTG GAAT G GTT GACAAAGT AC ACAAGAT CT CAGT CT ACAT 

AAAGGGTTGGGGGAAATTAGTATTTTCCTAAGTTTATTCTGTTTTCCTTGCTAGAAAAATCCCATAAAAGGGATATTCTT 
G : CAGACCTCTGCCATTTCTTCCATCTGTGAGAGAGCAACGTGATGTAACCACACCTGAGAGCAGGTACTGTCCTGTGTT 
As- A^AAG AAT CACAT G G AAAATT G T T AAC CAAAVTAAGT GT AGATT T CAGAATC ATT C CAG CTC TACT CTT AAC T GT CCT 
G AATTT GTT AGAAACT G ATTT GA-AG AAAT GT TT CTTAAAATT C CAT ACAC ACACAC ACAC AT ATT AT GAATT GT CT TT A 
tt- r?*^ AG ^ G AGAA ^"^^ A ^ ^ A ^ TT G ATTTGGACCATTTTAAACATT C C C TATTTT AAAATT CAT AC G GCTTC C 

T.AAGAAGTAGAATGA^AGGGTA^ATTGGTGACATGTTTGCTCTCTGCATTTTCTAACCTTCTACCGAATTGTGACTGAC 
1^ A ? GG ^ G ^ AG CATT TAC CAG T G AGGTT CAGAAACT AAACTCT CAG GAATT CAT T G CAT CT GTTGTAGAAGT GCTTC 
T^GGTCTGAGCCTGGCCTCCTC^GAGTGGTAATACTGCCCACTTCCTCTGGAAAATAGGCAGGGCTAATGAGAA^ 
CAAATGATTAACCTCTGCGGCCCTTCAGCCTTTGGAATGCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



SEQID. No: 55 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 



SCIsa49p01-full.seq| 14351 Frame 2 

MFDGYDSCSEDTSSSSSSEESEEEVAPLPSNLPI f(iwncTr _ _ 

IKNNCKlVTTYPDGKSC^TCEWCGMVG^AFVSICrKRFCSVSCSRSYSSNSKKASr^ 
WGKPPTKKAKVLQKQPLN^IJVAYAQYQATLQNQAKTKAGNSAISVEGFSWGNYINSNS 
FIAAPVACFKHAPMGTCWGDISENVRIEVPNTOCSLPTKVFWIAGIIKIAGYNALIiRYEG 
FENDSSLDFWCNICGSDIHPVGWCAASGKPLVPPRWQHKYT^KAFLVKRLTG^LPP 
DFSQKVSESMQYPFKPCMRVEVVDKRHLCRTRVAVVESV1GGRLRLVYEESEDGTDDFWC 
H>1HSPLIHHIGWSRSIGHRFKRSDITKKQDGHFDTPPHLFAK\^E^QSGEWFKEGWLE 

aidplnlsticvatirkvi^gf™igidgsej^ 

ELTP PRGYTKLPFKWFDYLRSTGS IAAPVKLFNKDVPNHGFRVGMKLEAVDLMEPRLICV 
ATVTRIIHRLLRIHFDGWEEEYDQVfVDCESPDLNPVGWCQLTGYQLQPPASQSSRESQSA 
SSKQKKKAKSQQYKGHKKXTTSQLKEELLDGEDYSFLHGASDQESNGSATVYIKQEP* 



SEQ ID. No: 56 
FIGURE 24 



WO 00/1 1 168 ^ j PCT/US99/19052 



>SCI sa61| 14347 

GG GGCTTCACTCTCCAGCCTTGGAGAGCAGTCAGGGAAGGCCTCCAGGAGGAAGCAGCTACCTGTGCCAGGGCAGAGCAG 
TGCCTGTACCAGTCTCCCGGAACTTCAG^CTGCGGTGGCCATGGAGTCTGTGGAGCCCCCGGTCAAAGACGGCATCCTC 
TACC^GCAGCACGTAAAGTTTGGCAAGAAATGCTGGCGCAAAGTGTGGGCTCTGCTGTATGCGGGAGG 

AGCTCGGCTAGAAAGCTGGGACGTGCGTGATGGTGGCCTGGGACCAGCAGGCGACAGGTCCACAGGGCCCAGCCGTCGAG 
GGGAACGCCGGGTCATACGCTTGGCTGACTGTGTATCTGTCCTGCCTGCGGATGGCGAGAGCTGTCCCAGGGACACTGGT 
GCCTTCCTGATTACCACCACTGAGCGAAGCCACCTGTTGGCTGCACAGCACCGCCAGTCCTGGGTGGACCCCATCTGTCA 
GCTGGCCTTCCCGGGTACCGGAGAA7GTTCGTCAGGATCAGGACAGGCTGAGAATCCAAAAAGGGGCTTTGTTCCCATGG 

%£S£^S£g™ig2s%g™zg*»-«<«^ 

CAGCGGGAGCGTCTTCCAGAGCTCGCCATGTCCCCACCCTGCC 



* » ^CCTACCATTGCTGCTCAGCCCCAACACAAGAAGGACCGGCATC. w 4WV<mi ^ T[ ( -.^ B 

'GAi 
GA 

'CO 



CCAATGGGGGTC 
GGATCTGAGTTG 
AGGCCGGAAGCG 
AAGGGCCCCGCC 
GAACAACTTCGA 

AWCTGTACA^TTGAAC^GTAAT^ 



SEQ. ID. No: 57 
FIGURE 24 



00/11168 



PCT/US99/19052 



SClsa61| 1434T Frame 2 \ 
MESVEPPVKDGILYQQHVKF \ 

GKKCWRKVWALLYAGGPSG^/ARLKWDVRDGGLGPAGDRSTGPSRRGERRVIRLADCVSV 
LPADGESCPRDTGAFLITTTERStiLAAQHRQSWVDPICQLAFPGTGECSSGSGQAENPK 
RGFVPMEENSIYSSWQEVTEFPVITQKTEATSRCQLKGPYLLVLGQDDIQLRETSKPQAC 

fsmpyrelricygsdkg\^sfeagp|cdsgeglfafssprapdrcgvvaaaiarqrerlpe 
ijwsppcplpralslpsleppgeiIevapgfelptprklpltdpgpqslplllspntrrt 

GIRS LCVRVQADQQAHRHGGAFL jRVHAGGQPWADQWGS* 



SEQ ID. No: 58 
FIGURE 24 



WO 00/1 1168 _ y PCT/US99/19052 



GGCCTCCGACATATTGCCCGCAGGAGCTGCGGCGGTGAGCGGAGAGCGCCGGGGGAAGGAGATGGGAGGACGAAGAGGTC 
CGAACAGGACATCTTACTATCGAAACCCGCTCTGTGAGCCAGGATCATCAGGGGCCTCTGGTGGAG^ 
GCATCCGTGAGCAGCGTCCGTTCTCGAAGCAGGACCACTTCTGGGACGGGCCTCTCCAGCCCTCCGCTGGCCGCGCAGAC - 
CGTTGTGCCTCTACAGCACTGCAAGATCCCCGAGCTGCCCGTCCAGGCCAGCATTCTGTTCGAGCTACAGCTCTTCTTCT * 
GTCAGCTCATAGCCCTGTTCGTGCACTACATCAACATCTATAAGACAGTCTGGTGGTATCCACCCTCGCACCCACCCTCC 
CACACTTCCCTGAACTTCCACCTGATCGACTTCAACTTGCTGATGGTGACCGCCATTGT^ 

GTCCATCGTGAAGGAGGCTTCTCAGAGGGGGAAAGGTCTCCCTCTTCCGCTCCATCCTGCTGTTCCTCACCCGCTTCACG 
TTCTCACGGCGACAGGCTGGAGTCTGTGCCGGTCCCTCATCCACCTCTTCAGGACCTACTCCTTCCTGAACCTCCTGTTC 
CTCTGCTATCCGTTTGGGATGTACATTCCGTTCCTGCAGCT^AACTATGATCTTCGCAAGACGAACCTCTTCACCCACAT 
GGCTTCCATGGGACCCCGAGAGGCAGTCAGXGGCCTGGCAAGGAGTCGGGACTACTTTCTGACACTGCGGGAGACTTGGA 
AGCAGCATACACGACAGCTATATGGCCCGGAAGCCATGCCCACCCATGCCTGCTGCTTGTCACCCAGCCTCATTCGCAAT 
GAAGTT GAGTT CCT CAAAAT GGACTT CAATTGGCGAATGAAGGAAGT ACTT GTCAGCT C CAT GCT GAGTG C CTACT AT GT 
GGCCTTT GT ACCT GT GT G GTTTGT GAAGAATACACATTACTATGACAAGC G CTGGT CCT GT GAGCT CTTC CT GCT G GT GT 
CCATCAGTACCTCCGTGATCCTCATGCAGCACTTGTTGCCTGCCAGCTACTGTGACCT 

CTGGCGTGCTGGCAGAAGGTGGACCCAGCTCTGTGCTCCAATGTGCTGCAGCCACCGTGGACTGAAGAGTGCATGTGGCC 

ACAGGGTGTGCTGGTGAAACACAGCAAGAATGTCTACAAAGCAGTGGGCCACTACA^ 

CCACTTCCGCTTCCATTTCTTTTTCAGCAACCCCCTGCGGATCCTCAACATCCTTTTG 

TCT ACCAGCT GTACTCCTTAAT GT CCT CAGAGAAGTGGCAC CAGACCAT CT CC CT G GCCCTCAT C CTCTT CAGCAACTAC 
TATGCCTTCTTCAAGCTGCTCCGGGACCGCCTGGTATTGGGCAAAGCTACTCCTACTCAGCCAGCCCCCAGAGGACCTGG 

ACCACCGGTTCTCCTGAGTCCCAGGGTGATCCTCCAAPiCAACATCC^^ 

TT G G GGTTG GGG G GG AGGGATACAAAAAACAAAAACAAAAAACAACAAAAAAAAACC AACCCAC CAGAGCTTTGTATTTT 
T GTT AC GTACT GTTT CTTT CTTTGAT AATT GATGTGGTAAGAAAAAAGTCTT ATTTTT ATACT C CAAAAA 



SEQ. ID. No: 59 
FIGURE 24 



WO 00/1 1 1 68 PCTAJS99/1 9052 



SCI smc3 4 I 14 348 Frame 2 

MGGRRGPNRTSYYRNPLCEPGSSGASGGGHSSSASVSSVR 

S RS RTT5GTGLS S P PLAAQTWPLQHCKI P ELPVQAS I LFELQLFFCQLIALFVHYINI Y 
KTVWWYPPSHPPSHTSI^FHLIDFNLIWTAIVLGRRFIGSIVKEASQRGKGLPLPLHPA 

VPHPLHVLTATGWSLCRSLIHLFRTYSFLNLLFLCYPrGMYIPFLQLNYDLRKTNLFTHM 
ASMGPREAVSGLARSRDYFLTLRETWKQHTRQLYGPEAMPTHACCLSPSLIRNEVEFLKM 
D FNWRMKEVXVS SMLSAYYVAJVP WFVKNTH YYDKRWSCELFLLVS I STSVI LMQKLLP 
AS YCDLLHKAAAHlACWQKV^PALCSNVTiQP PWTEECMWPQGVLVKHS KNVYKAVGHYNV 
ASPPMSPTSASISFSATPCGSSTSFCCWRVPSLSTSCTP* 



SEQ ID. No: 60 
FIGURE 24 



WO 00/11168 



PCT/US99/19052 




>SC ! AA4 . 1 1 15737 

GTGAAAGCAGCAGTGCGCCTCTGCTCCCTTCAGAGCACAGCCTGGTGTCAAGGTCCAGGTTCCACCGGCTGCTGCTGTCA 

CCGCAGGGGAGTCTAGCCCCTCCCAGAAGGAGACACAGAAGAATGGCCATCTCAACTGGTTTGTTCCTGCTGCTGGGGCT 

CCTTGGCCAGCCCTGGGCAGGGGCTGCTGCTGATTCACAGGCTGTGGTGTGCGAGGGGACTGCCTGCTATACAGCCCATT 

GGGGCAAGCTGAGTGCCGCTGAAGCCCAGCATCGCTGCAATGAGAATGGAGGCAATCTTGCCACCGT 

GAGGCCCGGCATGTTCAGCAAGCCCTGACTCAGCTCCTGAAGACCA^ 

GATCGGGCTCCAGCGAGAGAAGGGCAACTGTACGTACCATGATTTGCCAATGAGGGGCTTCAGCTGGGTGGGTGGTGGAG 

AGGACACAGCTTATTCAAACTGGTACAAAGCCAGCAAGAGCTCCTGTATCTTTAAACGCTGTGTGTCCCTCATACTGGAC 

CTGTCCTTGACACCTCACCCCAGCCATCTGCCCAAGTGGCATGAGAGTCCCTGTGGGACCCCCGAAGCTCCAGGTAACAG 

CATTGAAGGTTTCCTGTGCAAGTTCAACTTCAAAGGCATGTGTAGGCCACTGGCGCTGGGTGGTCCAGGGCGGGTGACCT 

ATACCACCCCTTTCCAGGCCACTACCTCCTCTCTGGAGGCTGTGCCTTTTGCCTCTGTAGCCAATGTAGCTTGTGGGGAT 

GAAG CTAAGAGTGAAAC CCACTATTT CCTAT GCAATGAAAAGACTCCAGGAAT ATT T CACT G G GGCAGCT CAG GCCCACT , 

CTGTGTCAGCCCCAAGTTTGGTTGCAGTTTCAACAACGGGGGCTGCCAGCAGGATTGCTTCGAAGGTGGCGATGGCTCCT T 

TCCGCTGCGGCTGCCGGCCTGGATTTCGACTGCTGGATGATCTAGTAACTTGTGCCTCCAGGAACCCCTGCAGCTCAAAC [ 

CCATGCACAGGAGGT GGCATGTGCCATTCTGTACCACTCAGTGAAAACTACACTT GCC GTTGTCCCAGCGGCTACCAGCT 

GGACT CTAGC CAAGT GCACTGT GTGGAT ATAGAT GAGT GC CAGGACT CCCCCTGTGCC CAGGATT GT GT CAACACT CTAG 

GGAGCTTCCACTGTGAATGTTGGGTTGGTTACCAACCCAGTGGCCCCAAGGAAGAGGCCTGTGAAGATGTGGATGAGTGT 

GCAGCTGCCAACTCGCCCTGTGCCCAAGGCTGCATCAACACTGATGGCTCTTTCTACTGCTCCTGTAAAGAGGGCTATAT 

TGTGTCTGGGGAAGACAGTACCCAGTGTGAGGATATAGATGAGTGTTCGGACGCAAGGGGCAATCCATGTGATTCCCTGT 

GCTTCAACACAGATGGTTCCTTCAGGTGTGGCTGCCCGCCAGGCTGGGAGCTGGCTCCCAATGGGGTCTTTTGTAGCAGG 

GG C ACT GTGTTTT CT GAACT AC CAGC CAGGCCTCC C CAAAAGGAAG ACAACGAT GAC AGAAAGGAGAGTACT ATG CCTCC 

TACTGAAATGCCCAGTTCTCCTAGTGGCTCTAAGGATGTCTCCAACAGAGCACAGACAACAGGTCTCTTC 

ATATTCCCACTGC CT CT GTT C CACTAGAAATAGAAAT CCCTAGTGAAGTAT CT GATGTCT GGTTC GAGTTGG G CACATAC 

CT CC CCACGACCT CC GG C CACAGCAAGC CGACACATGAAGATT CT GTGT CT GCACACAGTGACAC CGAT GGGCAGAAC CT 

GCTTCTGTTTTACATCCTGGGGACG<5TGGTGGCCATCTCACTCTTGCTGGTGCTGGCCCTAGGGATTCTCATTTATCATA 

AACGGAGAGCCAAGAAGGAGGAGATAAAAGAGAAGAAGCCTC^GAATGCAGCCGACAGCT 

GCAGAGAGCCAAGCCCCGGAGAATCAGTACAGCCCAACACC^GG 

CTGCCACTACCTTCAGAGCTACCTTCTTAGATGAGGGGGAAGCCACATCATTCTGAATGACTTGACTGGACTCTCAGCAA 
AAAAATTGTGCACCTTCCACTTAAGAACCTGGTGGCTTGGGATAGGCAGGTATTTTCTTGGTGCCTTTGATATGTCTGGG 
GGT GAAAGCT GT GT GTT G GTTTGT CATTGT G GG GAGTTTT GT GGATAT T GACAGACCT CACTC AAACAC C CTTTT CAAAT 
CCAATAGCAACTGGTTCCTCTGGTTCCTAATTAGGGGGAAAGGAGTCAGAGGGGTGGGACAGGGTGGGGGGATGGGGCTT 
CAAAGTTTTTTCTTATCACTTGATTTATCATCGAAGGAGTTACTGGTGCTAATTACAATGGAAACAGTTCCTTTCCATCA 
CAGGACAGACACACCT CAATCCTC CAT GGGGTCAACAACT ATATACC C C C AGT GACCC CT T AGG CAAGGACTTGTT GAGA 
ACTGCATCJVCATTTTGACCTGTTCTCAACAGTACCCATCTATTTCAGGTGGGATCTCTGGACCTTTCCTCCTTCCCATCT 
TGTCTGCAATGTGGCAAATGGCTTCTTTTTGCATTTTTACTCCGCCCCCACCCCAAGCTGAAGTTCATTTGCAGATCAGC 
GATTAAGTCTGAATTGTGTGGTGGTCAGTCTTGTTTCCTTTTGTCAGGGGTTATTGTAAATGTTAGTAATTTCGCCTCAA 
GCCCTCAGTAAGAACATAAATATTTTAAAATATGTGCGTTTGAAATCTGTTTCATGCATCCTGGAACTGTGGGATGCTCA 
GGCAAGAGTGACTTTAGTCTTTCAGTGAATGTTGCCCAGAATGTGGGTAGGGAAGGCTCACAGGTTACTCTCCTCCTTAG 
AGCTACAACATAACATTCTGAGGGGAGTCACAGGGTTGCCTTTAAAAAGTGGGAGCTATGTCATGCTTTGAGCTTTCTGT 
TAAGCACCTCTCCTAATAAACTCT GAAAAAA 



SEQ. ID. No: 



61 



FIGURE 24 



WO 00/1 1 168 PCT/US99/19052 



SCIAA4. H 15737 Frame 3 
MAI STGLFLLLGLLGQPWAG 

AAADSQAWC EGTACYTAHWGKLS AAEAQHRCNENGGNIATVKS EE EARHVQQAI/TQLLK 
TKAPLEAKMGKFWIGLQREKGNCTYHDLPMRGFSWVGGGEDTAVSNWYKASKSSCIFKRC 
VSLILDLSLTPHPSHLPKWHESECGTPEAPGNSIEGFLCKFMFKGMCRPLALGGPGRVTY 
TTPFQATTSSLEAVPFASVAMVACGDEAKSETHYFLCNEKTPGIFHWGSSGPLCVSPKFG 
CSFNNGGCQQDCFEGGDGSFRCGCRPGFRLLDDLVTCASRNPCSSNPCTGGGMCHSVPLS 

ENYTCRCPSGYQLDSSQVHCVDIDECQDSPCAQDCVNTLGSFHCECWVGYQPSGPKEEAC 
E DVD ECAAANSPCAQGC INT DGSFYCSCKEGYIVSGEDSTQC EDI DECS DARGNPCDSLC 
FMTDGSFRCGCPPGWELAPNGVFCSRGTVFSELPARPPQKEDNDDRKESTMPPTEMPSSP 
SGSKDVSNRAQTTGLFVQSDIPTASVPLEIEIPSEVSDVWFELGTYLPTTSGHSKPTHED 
SVSAHSDTDGQNLLLFYILGTVVAISLLLVLALGILIYHKRRAKKEEIKEKKPQNAADSY 
SWVPERAESQAPENQYS PTPGTDC* 
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SCIHDD2I 16960 

CCCATTACTTCCACCTCCCATGAGGACTGCACCTGCGGCAGCGATTTTATGTGAA.CTT GGATTACT GTATGGAAAT GGAG - 
GAGTGAGAAAGT GTGGGAAT CATAAAAGAAGG CAGAAC CAAAAGACAGAGGGAT CTTGGGCCATGAGTCACAGCAAGGAA " 
AGCCTCCCACCAAACAT CT GCACT G GACTC CTACAGGAACAAGAAAT GAACTAT CGTGTTCAGCCCCTGAGACTTGGGCT . 
TAACATGATTTTCACATTGTCTACCCTACCTAATAGAGCAGAGATGTAAATATTATTCTTATTTTAGAGGTGTGATG 
CAGCTGCAATGGGTGAGAACTACTCCTQVrTTATTCTCTT^ 

T CACAT CT AGT CAAAAGAGCT AAT GT C GCAGTACAACT CTT CAAAAGAAAAAGAAAAAACAAGAAAAAAGT AAT AAACAG 
ATGTGTTCTGCTTGAAAAA 
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TGLLQEQE.MNYRVQPLRLGLNMIFTI.STLPNBAEM* 



SEQ.ID. No: 64 
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ACGGAGACAGCTGCAGCGGGGACAGTATTCAGGACAGCTACACCGGCATGGAAAACTCCGACAAGGACGCCATGAACAGC 
CA.^TTTGCTAATGAAGATGCCGAAAGTCAGAAGTTCCTGACAAATGGGTTTTTAGGGAAGAAGAAGCTAGCCGATTACGC 
GGATGAGCATCACCCTGGAATGACTTCCTTTGGAATGTCCTCATTTAACCTGAGCAACGCCATCATGGGCAGTGGGATCT 
TAGGCTTGTCCTATGCCATGGCCAACACCGGGATCATCCTTTTTATAATCATGCTGCTTACTGTGGCAATACTCTCGCTC 
TACTCGGTTCACCTTTTGCTGAAGACAGCCAAGGAAGGAGGGTCTCTAATCTATGAAAAATTGGGCGAGAAAGCATTTGG 
ATGGCCTGGGAAAATTGGAGCCTTCATCTCTATTACAATGCAGAACATTGGAGCCATGTCAAGCTACCTCTTCATCATTA 
AGT AC GAACTGCCT GAAGTAATCAGAGCATTCATGGGACTT GAAGAAAACACTGGGGAATGGT AC CT CAACGGC AACTAC 
CTCGTCTTATTTGTGTCCGTGGGGATCATCCTCCCGCTCTCTCTCCTTAAAAATTTAGGCTACCTTGGCTACACCAGTGG 
ATTTTCTCTCTCCTGCATGGTGTTTTTCGTCAGTGTGGTGATTTACAAAAAATTCCAAATTCCCTGCCCTCTGCCTGCTC 
T G GAT CACAACAAC GGAAATCT GAC GTTCAACAACACACTT C CGATT CACATGAT CT C GCT GCCT AAT GACTCGGAGAGC 
TCGGGTGTGAACTTCATGATGGATTACGCTCACCACAACCCAGCTGGGCTGGATGAGAAGCAGGTCGCAGGCCCTCTTC^ 
CAGCAATGGCGTGGAGTACGAAGCCCAGGGTGCTGACAAATGCCAACGAAAATACTTTGTGT^ 

ATGCAATCCCAATCCTGGCTTTTGCTTTTGTCTGCCACCCTGAGGTCCTTCCCATCTACAGCGAGCTTAAAGATCGATCC * 
CGCAGAAAGATGCAGACGGTGTCCAACATTTCCATCTCAGGCATGCTCGTCATG 

TCTGAGCTTCTACGGGGACGTTGAAGACGAGCTGCTGCATGCTTACAGCAAGGTCTACACATTTGATACGGCTCTTCTCA " 

TGGTGCGCCTGGCAGTCCTGGTGGCAGTGACACTGACCGTGCCCATCGTGCTGTTCCCGATCCGTACTTCGGTGATCACA 

CTGCTGTTTCCAAGGAAACCCTTCAGCTGGCTGAAGCATTTCG 

GGTCATCCTCGTGCCTACCATCAAATACATCTTTGGATTCATAGGGGCTTCTTCTGCCACTATGCTGATTTTCATTCTTC 
CGGCTGCGTTTTATCTCAAGCTCGTCAAGAAAGAACCTCTAAGATCACCCCAGAAGATTGGGGCTTTGGTCTTCCTTGTG 
ACT G GAATT ATTTT CAT GATGGGAAG CATGG CGCT CATTAT ACT C GACTGGAT CT ACAAC CC GC C GAAT C C CAATCACCA 
CT AAT C C CG GGGAGAC G C GT CT C CACT GGAAACAGCT GAAATT GT CT GAAGGAC ATTTT AGTTGT CTTGATT GGGATGTT 
AGT C T GAG GAATT AG CAAGATT C CAAAGACGTTTTT CTAG CT CT ATC AGCAC ACAT TTTAAC C CAG GC CGT GCAGTGCAG 
TGTGTGATGCCCGAGTTGTGTTTGCAGCAGCTGTGCAAGCTGAAGCCTGTTGGCTGCGTGTGTTGGTCAGCAGACAATAG 
CCTGTCCCCC CATG GT CACTC CACTTCT CT C CACC CC CAGATTAACAGGT AAT T CT ACT CT CAGAACATCAGACAAAGAC 
CTCCTGGTTGGGATACTTGTGGAAGAGAAAATTATGGGTTTTGTTGGGAATGGTTTTGTTGGGAATGGTGAAGGATGCAT 
T AAAAATTCT GT GC GAAGT AT CAT CAGTT AC GGC CAT CT CT CACT CTACACCAACACT AAG G GT CGGTTGACT AG CTGAG ; 
GCAGGGGGATAT CTT GGGCT GTCC C TGT GAG GTtT CAT GACGTAT GACGGTT GC CAGT AT AGAGT ACTTC ATTTCAATACT i 
CAAGGAATAGTTTGCCCAACCTGCTTATTACACCGAGTTAGTGAATTTTCATTCTTTTTATTTTTTGGTATGGCAAATGT \ 
TCAAACATCTCACTAAACATAGAGGGGGGTTATTTAGTTTATCATGAAACAACTACCAAAAAGTAATGGTTTTTAACATC 
TGCCTTTTCATGTTGTTTCTGATATCTTTGTTCCACTTTGTCTTTGAACAAGGCTTCCCCTCTCGTCTTCCGTTCTCACT 
CTT C GT AATTT T C AGAAAAAC CAC AGT C CGT GT GGGAGAC ACACTACCC C AGT ATTGTTT GAT AC AT CTCTATT T GATAA ' 

ACAT T C AGT G CAGGAAACTGTGAT TTTG C TATAT GTTTGT GTACATAATCT CATTCTG CAGT TAT CAGAACGTT GACATA 
TGGGACATTGGATTTTTATTTTTTACATATGTAGGTTTTTTTTCTTCACAGACAAAATGTTTATATCATCAGGGTGGGGC 
AGGGAATTAAGCTGGTGGGCTCAAAAATCCATGGGTACTTATCTGTCCATTGGAGACATCTAAAAATTAAAGTCAAAGTT 
GTGCATAGTTCAGTAATGCTCTTCACTGTTTACAAGACTATAATCATCTCAGCAAAGTAGCGAAGGAAGGGTTTGTAAAT 
AATTTTCAGTGCTAACAATGGTCTTTGAATTTAACATGTCTAGAATAGGGTTTAGTTCATTTTAGTTTACATCGGAACTT 
G G T G ACATT CATTT G C C ATT AAT AGAACAT C GT GTGGTT AG GAATAAATGCAC CAACAT GAC C C AAAACCAC ATT GCT CA 

CATCCATGGACGCTTACACGGGATGGAGTTCAGACTCCGCCCACTGCCTAGAATCTCTGGCATTGACTTCTTTTTTTCAA 
TTACTGGTTGTATATTTTACATTTCAGTAAAAAGCCAAGTCTGAAATGCAAGTATCCTAC^GGTAGTGATATACTGTCAC 
AC C AAAGCAGTTTTTCA\CAGT GTGCCTAAGAAACATATATTTGTTTACTGT GTCATTTTCT GGACCTTAAGACTATACA 
AT G GT GT G C TTTGT GATAGAAATT GAGCATT ACT AGAAACTACC ATAACT TC G GAGTTAT TAT CATTTG GGAGGGAAAAA 
AAT CACCAAATAGGATCTTTAAGATGTCTTT CT ATTTT CATTCATCCAAGAAATAAATCAAGATAGAAAGCAGAAAGGGT 
GT G CT GCAAAT ATTAAATTATTAT T TT ATCTGAACATATTAATGTGAATAT AAAAGATT AAT CCAAAAAT ATTGGAAACC 



TGTGAAATCTGTACGTATTTCGGATCTTTAfACTTAGAGCAGCTAACCTGGAAGTATCCACTTGTGTAGTTAGATATTCA 
AGAGCAAT AT CACAGT G C CAAAGAC TTCATT CACACAG CAC C GATC ATTT GT GTG G CC GTT TT CAG G ATTAGAGTT CACA 
C AGACT C CAG G GCAGATACTCAGAATAC CGTTTT CTGG T AAAT TT AACCCATTT GT AAACAG AT ACAAATTTT ATTTTCT 
TACATCATTTATAAGACGGTTCAATGTAC*GGGGGATTTTTTTTATTACAGACTGTGTATTGGTATATAATAAATAAAC 
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SCIaftpl) 15065 Frame 2 i 

MDPMEIJWVSIEPDGDSCSGDSIQDafTQffiNSDKDAMNSQFAMEDA 
^!?^f^ NGFLG KKKLAD YADEHHPdbs FGMSS FNLSNAIMGSGI LGLS YAMANTGI I L 

f£l^ S ELKDRSI ^^ SNIS ISG>tt\^LIAALFGYLSFYGDVEDFJ,LHAYSKW 
vrrvoil^I!!^^'^^' LTVP I VL F P I RTS VI TLLFPRKPFSWLKHFG I AAI 1 1 ALNK I L 

GSMAJLIILDWIYNPPNPNHH* 
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>SC|F77!1268S 

AGCGACATGGCCCCGCCCGCGCTCCAGGCCCAGCCTCCAGGCGGCTCTCAACTGAGGTTCCTGCTGTTCCTGCTGCTGTT 
GCTGCTGCTGCTGTCATGGCCATCGCAGGGGGACGCCCTGGCAATGCCTGAACAGCGACCCTCCGGCCCTGAGTCCCAAC 
TCAACGCCGACGAGCTACGGGGTCGCTTCCAGGACCTGCTGAGCCG^ 

TCAGAACCAAGTCCTGACCCAGCTGTCCGGATACTCAGTCCAGAGGTGAGATTGGGGTCCCAC GGCCAGCTGCTACTCCG 
CGTCAACCGGGCGTCGCTGAGTCAGGGTCTCCCCGAAGCCTACCGCGTGCACCGAGCGCTGCTCCTGCTGACGCCGACGG 
CCCGCCCCTGGGACATCACTAGGCCCCTGAAGCGTGCGCTCAGCCTCCGGGGACCCCGTGCTCCCGCATTACGCCTGCGC 
CTGACGCCGCCTCCGGACCTGGCTATGCTGCCCTCTGGCGGCACGCAGCTGGAACTGCGCTTACGGGTAGCCGCCGGCAG 
GGGGCGCCGAAGCGCGCATGCGCACCCAAGAGACTCGTGCCCACTGGGTCCGGGGCGCTGCTGTCACTTGGAGACTGTGC 
AGGCAACTCTTGAAGACTTGGGCTGGAGCGACTGGGTGCTGTCCCCGCGCCAGCTGCAGCTGAGCATGTGCGTGGGCGAG 
TGTCCCCACCTGTATCGCTCCGCGAACACGCATGCGCAGATQW^C^CGCCTGCATGGCCTGCAGCCTGACAAGGTGCC 
TGCCCCGTGCTGTGTCCCCTCCAGCTACACCCCGGTGGTTCTTATGCACAGGACAGACAGTGGTGTGTCACTGCAGACTT 
ATGATGACCTGGTGGCCCGGGGCTGCCACTGCGCTTGAGCACCGGGCCCTGCTCCTCACCTACACTCCCCTTCAAGGATG 
CTATTT ATATTTGTATTTATT AATATTATTAATTTATTG GGGTCGGGCT G GGT GGAT GGATT GT GTATTTATTTAAAACT 
CTG CTAATAAAGGTGAG CTT GGTTT CAAAA 



SEQ.ID. No: 67 
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>SC|F77 112 685 Frame 1 

MAPPALQAQPPGGSQLRFLLFLLLLLLLLSWPSQGDALAMPEQRPSGPESQLNADELR 
™™^ LLLTP ™ PWDITRPLK ^ S ^^ 

LRV^GRGRRSAHAHPRDSCPLGPGRCCHLETVQATLEDLWSDWVLSPRQLQLSMCVGE 
CPHLYRSANTHAQIKAilLHGLQPDKVPAPCCVPSSYTPVVLMHRTDSGVSLQTYDDLVAR 
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SEQUENCE LISTING 

<110> Lemischka, Ihor 
Moore, Kateri 

<120> GENES THAT REGULATE HEMATOPIETIC BLOOD FORMING STEM 
CELLS AND USES THEREOF 

<130> 2275-1-005PCT 

<14 0> UNASSIGNED 
<141> 1999-08-20 

<150> US 09/138, 132 
<151> 1998-08-21 

<160> 96 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 400 

<212> DNA 

<213> Mus musculus 

<400> 1 

cccacgcgtc cggagagcgg 

gttcttttct gctagcccaa 

aaagcagaaa aaaaggccaa 

gatcccacat ctggaatctg 

agaagttatg cagtggtctc 

gtctttggaa gattctaaag 

tgaggactat nanaaaacan 

<210> 2 
<211> 148 
<212> DNA 
<213> Mus musculus 

<400> 2 

cggctgctag aagacgacag aaggggactc actcgttttg agaagaccat ganagcggca 60 

gccntctcca tgccaaggtt gaacaaaatg ccaggaatgt tcttttctgc tancccaaag 120 

gattcgaaan aacacagcca ttctcttc 148 

<210> 3 

<211> 400 

<212> DNA 

<213> Mus musculus 

<400> 3 

tacggctgcg agaagacnac agaaggggaa ncagctgcgg tggccgcggg agtctgacaa 60 

tgcaaagtgc catgttcctg gctgtccngc acgactgcgt acccazggac aagagtgcan 120 

gcaacggccc caaggtcnan gagaagcggg agaaaatgaa tcggacactc ttnaangatt 180 

ggaanacccg tttgagctac ttctngcaga gnncctctgc tcccgggaag cccnnaactg 240 

gcaataaaac naacagcata cttttatcna nccttctcct nangaagcgc anctctnggc 300 

ataaactttt natnanctgc tggccngtat atatgggctg gctgcattca nggcgttttt 360 

taaagtccca gttctntgaa gaaaacattg aaattctggt 400 

<210> 4 

<2U> 400 

<212> DNA 

<213> Mus musculus 

<400> 4 



cagccatctc catgccaagg 
aggattcgaa agaacacagc 
agacttttgg aatggacgtg 
ggatgaaatc ggccaagtcc 
agtctctgga aaaatccttg 
tctgattcag tgaggaaaat 
aaactgatct tttgcatanc 



ttgaacaaaa tgccaggaat 60 
cattctcttc tagacnacaa 120 
aaaacatacc tgagatcgat 180 
aaagacatac tttctgctga 240 
ccaaccagac aggtcnaaat 300 
attgaattct ggttggcttg 360 
400 
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gcctgtggtg gccccggi ccctcccttc 

cacctgaacc agagaaacgc cattcgggca 

ttcagagtgc agacgctctt cctcctggga 

tcctcagagt cagcagcaca cagggatatc 

aacctcaccc tcaagaccct cagtggactg 

cgctacatcc tcaagacgga tgatgacgtg 

ctgatacaaa gaagggggcc ttcggacaat 

<210> 5 
<211> 1081 
<212> DNA 

<213> Mus musculus 



ctgctcatcc tggtgtgtac gc, ccggag 60 

tcttggggtg ccatccgcga agcccggggt 120 

aaacctagaa gacagcagct tgctgacctg 180 

ttgcaggcct ccttccagga ttcctaccgc 240 

aactgggtga acaaatactg tcctatggcc 300 

tatgtcaacg tcccananct ggtgtcagag 360 
ggcagaaggg 4 00 



<400> 5 

cccacgcgtc cgatctcctc cagggccacc aagcacctct gaagagccat gttccaagct 60 
gccggagccg cccaggccac cccctctcat gaagccaaag gcagcagtgg cagcagcacg 120 
gtacagcggt ctaagtcctt tagcttgcgg gctcaggtga aggagacctg tgcagcctgc 180 
cagaagactg tgtacccgat ggagcggctg gtggcagaca agctcatttt ccacaactct 240 
tgtttctgtt gcaaacactg ccacaccaaa ctcagcctgg gcagttatgc tgcaatgcac 300 
ggtgaatttt accgcagacc tcactttcag cagctgttta agagtaaagg caactacgat 360 
gaagggtttg gtcgtaaaca gcacaaggag ctctgggccc acaaggaggt ggactcaggc 420 
accaagacgg cctgagaccc ctttaacacc cattccctcc cagcacatgg cctcccgctg 480 
ggcagtggaa aggagattaa cccgggggcg cggggtggga gaggatgagg ctccctcaca 540 
caggtttcag gcataaggct ctgctccagg attccttact tttcccatgg gaggttggcg 600 
ttgggaacca gaattggaat tttcaccata ctgtgtcctt tagtccacct catctcaccc 660 
cacggctccc tgggaggccc acaagcccag cttccatact taggtgcttt tctccagcaa 720 
ggagtcagca tgccctcctc agggtcccaa gctccctcac tgccacctgg gccttgtgta 780 
cccccttgtc tccccatcta cctctgcccc ttagcctggt aatgagccac agagactgga 840 
agagggagag tgccatntac tgggcctcat agatgccacc tcgctgaggg gggagggctg 900 
gggaagaggc aagacagcct gcagccttca gggtctgggg gtcccttgca ccacaaagct 960 
aaagctcttg ctagagcctc agctgacagg gtcggcagta gctatgctcc tccatctgtt 1020 
gtgctgttct gttgtgatca accctctttt aaaaacattt aaacagcaaa aaaaaaaaaa 1080 
a 1081 

<210> 6 
<211> 743 
<212> DNA 

<213> Mus musculus 



<400> 6 

attggtacgc ctgcacgnac cggtccggaa ttcccgggtc gaccacgcgt ccgcccacgc 60 

gtccgatctc ctccaggggg caccaagcac ctctgaagaa catgttccaa gctgccggan 120 

gcgcccangc caccccctct cntgaagccc aangcagcan tggcagcagc acggtncagc 180 

ggtctaagtc ctttaacttg cgggctcagg tgaanganac ctgtgcagcc tgccngaaaa 240 

ctgtgtnccc natggancgg ctggtggcag acaanctcnt tttccacaac tcttgtttct 300 

gttgcaaaca ctgccacacc aaactcancc tgggcagtta tgctgcnatg caccgtgaat 360 

tttactgcag acctcacttt cagcagctgt ttnagaatna aggcnactac natgaaaggt 420 

ttggtcgtta acagcacaan ganctctggg cccncnagga agtngactca ggcccnanan 480 

aggctganaa ccctttaaca cccattccct cccnncacat tgnctcccnc tnggcagttg 540 

gaaaagaaaa taancccngg gcnccgggtt ggganaagaa aaaggtcccc cccnccggtt 600 

ctccggcnta agggtctccc ccnnaatccc ttcttttncc cctgggaagt ttgggtttgg 660 

gaacccaaat tggaatttcc ccctnccgtn ttcctttntt ccccnccnct cccccccnng 720 
gtccctggga aggccccaan ccn 743 



<210> 7 

<211> 743 

<212> DNA 

<213> Mus musculus 



<400> 7 

gggggagtct tgagtcgcat cgcccgttnc 

tttttttttt gtcttgctgt ttaaatgttt 

cacnacngat ggaggagcat anctnctgcc 

"ttagctttg tggtgcaagg gaccccnnac 



gtaagcttgg atcctctana acgggcgccc 60 

ttaaaaagaa ggttgatcac cacagaacng 120 

gaccctgtca gctgangctc tagcaanaac 1B0 

cctgaaagct gcaggctgtc ttgcctcttc 240 
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300 
360 
420 
480 
540 
600 
660 
720 
743 

<210> 8 
<211> 174 
<212> DNA 

<213> Mus musculus 



cccanccctc 
cntccnttct 
ggtncacnat 
ttgctggana 
ggtgagatna 
nacctcccat 
tttnaaggga 
ggtttttttn 
cctnntttgg 



CCCCCtCc 

czgtggctca 
gcccangtgg 
aaaagcacct 
ag^ggactat 
gggaaaaant 
ncctcatcct 
aaagggggtt 
gccccnaaaa 



gaagtggcat 
ttaccangct 
cantnaagga 
aanttggaac 
aggaccntat 
taggaatcct 
cttcccancc 
ccccnnggcc 
ccn 



ctatgaagcc 
aangggcnga 
cttgggacct 
tgggcttgtg 
ggtgaaaatt 
gcaaccaaac 
ccgngccccc 
ccccnntggt 



cagtanatgg 
ngttnatngg 
gaagaaggca 
ggcctcccnc 
ccaattctgg 
ccttatgcct 
nggggtttaa 
tnccctnaat 



ca ctccct 
ganacaaagg 
tgctgactcc 
ggaaccntgg 
ttcccaaccc 
gaaaacctgt 
agggaaaaag 
ttcccccccc 



<400> 8 

ggattccggt taggaatgaa attagaagct gtagatctca tggagccacg gttaatatgt 60 
gtagccacag ttactcgaat tattcaccat ctcttgagga tacattttga tggttgggaa 120 
gaagagtatg accagtgggt agactgtgag tcccctgacc tctatcctgt aggg. 174 

<210> 9 
<211> 711 
<212> DNA 

<213> Mus musculus 



<400> 9 

ggggctcagg 

tttttcccaa 

ccaaatggaa 

cgcgcgccct 

aagaaaaagc 

gtagcagctc 

caatcatcaa 

cctgtgagat 

tctgcagtgt 

ccngacttcn 

cnttagttgg 

acnaaancng 



gaaggccgat 
cacaacttct 
gtggctgaaa 
caattacctc 
agactggttc 
cagctctgag 
gaaraatgga 
gtgtgggatg 
tzcctgttca 
ncrtacgggt 
taaattnnct 
gcnattctgc 



cttccgggtg 
cggaggcaac 
gaaacttctc 
atcgacctgt 
gggatgtttg 
gagagtgaag 
caagtctaca 
gtcggtgtgc 
agaagttact 
tngcctccca 
gcctatgccc 
catctctgtg 



gagggggaag 
atattggaag 
gccctgctga 
ggtcttgaca 
acggctatga 
aagaagttgc 
catacccaga 
gagatgcttt 
cgtcaaactc 
cnaagaaagc 
ntntccanct 
gaaagggtcc 



cggcgtgact 
ggactcgggg 
ttctgagccc 
gaacattctt 
tagctgcagt 
tcctttacct 
tggtaaatct 
ttactctaaa 
taagaaggca 
caaantcctt 
accttgcnna 
ctgggggtta 



ggagtggaaa 
aggccggagt 
cgcgtcgtgc 
cacaatccaa 
gaggacacaa 
tccaatctcc 
ggcatggcta 
acgaaacgtt 
agcattctgg 
ccnaaacnac 
atccnnccag 



<210> 10 
<211> 356 
<212> DNA 

<213> Mus musculus 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

711 



<400> 10 

cacggccgcg anaagacgac agaagggtnc gggctgcgan aagacgacag aaagggggcc 60 

tttctccgct gcccccggcg cgcccggcag ctcctccccg gccatggcgt tcaccttcgc 120 

ggccttctgc ta^atgctgg cgctgctgct caccgccgcg ctcatcttct tcgccatctg 180 

gcacatcata ncr.-ttgatg agctgaagac cgactacaag aaaccctata gaccagtgca 240 

ataccctgaa ccrcttgtcc ttccagaagt actcatccac gcgttcttct gtgtcatgtt 300 

cctctgtgcg cagaattggc tgaccctggg cctcaatatg ccccttttgg catacc 356 

<210> 11 
<211> 400 
<212> DNA 

<213> Mus musculus 



<400> 11 

tacggctgcg agaagacgac agaaggggag 
cgagatggca tcrcgtagcc ggagacccga 
ccagaatgaa gcccggaaat acgttcgcaa 
gactgagcga gcgctggagc tcctctgttt 
cattggctgc gcrtctgggc tgagtggaaa 
gggcattgac atcagccctg ccatgttgga 



tgctcgccgg tcccagcgtt gccataccat 60 
acacagcgga ccgccggagc tgttttatga 120 
ctcacggatg attgacatcc agaccaanat 180 
accaganggt cagccttctt acctgttaga 240 
ttatatctca gaagaaggac actactgggt 300 
tgccgccttg gacgagatac anaaggggac 360 
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-gccgctang ggactgg;. 



aggctccrtt caaaccggcc 



400 



<210> 12 
<211> 400 
<212> DNA 

<213> Mus musculus 



<400> 12 

acggctgcga 

gcganaagac 

aaagctatgc 

ctgacccggg 

tcacggagga 

gtgggacaga 

aagatggggc 



gaagacgaca 
gacagaaggg 
tggggangca 
gtttccggcc 
acagatcaac 
tcaacagtat 
tgcngcccan 



gaagggtacg 
tacggccgcg 
gcgcgganct 
gganagcagt 
ctgacgagag 
gtctccaacg 
gatgggcggc 



gctgcgagaa 
agaagacgac 
tgattcacct 
canatatgaa 
gaccttcggg 
acagtggcat 
tccaggaagg 



gacgacagaa 
agaaggggga 
tcacctgctc 
cggacgggtg 
gctgggcttc 
ctacntcagc 



gggtacggct 60 
ggaaactgta 120 
tggccacccg 180 
gattatttag 240 
aacatcgtcn 300 
cgtatcaaan 360 
400 



<210> 13 
<211> 1360 
<212> DNA 

<213> Mus musculus 



<400> 13 

agctctagtc 

atttggagcc 

acagaatctt 

cctctggtgg 

gaactattgg 

ccaacttgtg 

atcgctttaa 

agtatgggac 

tccacgatgc 

tagcccactt 

agcctccgct 

acctcaacag 

ccgctcacta 

tccacatgga 

ccatcctaga 

azaaagccat 

cgccaacgct 

aggactatgt 

ccttggatat 

cctatgtggc 

gcgctcagtc 

aaagacgccc 

taaacaaact 



cccagagatg 
tgaggaggcc 
aaacccactg 
caacccctcc 
tttgggaacg 
ggtLccgtcc 
tcccaaggcc 
cgggcggctg 
ttttgtgaca 
tgatgggatc 
ggatgcgatg 
ggattctgaa 
cczacctccc 
gagtgtgaag 
cacaggcaca 
tgggggatat 
tccccctgtc 
ca-caagatt 
ccccaagcct 
tcr ctttgac 
tcgttcaaca 
tggrtagggt 
aaaaaaaaaa 



tcgccactac 
aaactgatcc 
aatggatggg 
tttgtgcctc 
cctcctcaga 
acgagatgtc 
tccagctcct 
agcggaatcc 
tttggagagg 
ctgggcctcg 
gtggagcaag 
gggtctgatg 
ctcaccttca 
gtcggcacag 
tccctcatca 
cccttcctga 
tccttccacc 
cttcagagcg 
gcgggacccc 
cgtggggaca 
gaccgggcag 
acaagctcac 
aaaaaaaaaa 



tgctgctgct 
gtgtccctct 
aacagctggc 
tctccaagtt 
atttcaccgt 
atttcttcag 
tcaggcccaa 
tgagccagga 
ctctgtggga 
gcttccccac 
ggctgctgga 
ggggagagct 
taccagtcac 
ggctgagcct 
caggacctag 
atgggcagta 
ttggtggagt 
atgttggcct 
tctggatcct 
aaaacgtcgg 
aaagaaggac 
cgggccacag 
aaaaaaaaaa 



gctgtgcctg 
tcaacgaatc 
agagctttct 
catgaacacc 
tgtctttgac 
tttggcatgc 
tgggaccaag 
caatctgact 
gcccagcctg 
tctggctgtg 
gaaacccgtc 
ggtcctaggg 
catccctgcc 
ctgtgcccag 
tgaggagatc 
cttcattcag 
ctggtttaac 
ctgcctgttg 
tggggacgtc 
cccgcgcgtg 
tacgcaggcg 
cagctatgct 



ctgctgggga 
caccttggac 
aggacctcca 
cagtattttg 
acgg'gttctt 
tggtttcacc 
tttgccattc 
atcgggggga 
atctttgctt 
ggcggagttc 
ttctcctttt 
ggctcagacc 
tactggcagg 
ggctgcagtg 
cgggccttga 
tgttccaaga 
ctcacaggcc 
ggcttccaag 
tttttggggc 
ggactggcgc 
cagttcttca 
tctttccaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 



<210> 14 

<211> 1712 

<212> DNA 

<213> Mus musculus 



<400> 14 

tcttcgaaag 

agtcatggct 

gaatgtactg 

ggtccccagc 

ttatggctgt 

tagtgggagt 

caccggaata 

catggaaaaa 

gttggcagag 

caaccttgtt 

cggggagctc 

gcacaaagtt 



ccgggctgag 
gc--cccgag 
aagacatctg 
tacatccggg 
ggtctgactg 
gccagggatt 
gacatgactg 
ttzggtttcc 
gc -gggatcc 
cc-gataaac 
ta"tcagtg 
tza-igggggg 



gggaatcctg 
acgctgatga 
cagacctcca 
aaagtctgca 
ttcctgagcg 
gctatgtgct 
aggtccaggt 
aggcacccaa 
agagtgagag 
aacaagtcct 
acgtctatgc 
aatgcctggg 



gacaggggaa 
gatccacaag 
gactaatgct 
gaatgtacac 
gctggaaaac 
tagccagctg 
cgaagtggct 
tgtgactttt 
ctatggtatt 
ccaggaggtc 
cagccttgaa 
aggcgctctg 
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tcctggacgt 
gacgttcaga 
tgtgtcacgc 
gaagacgtta 
tgccgaattt 
gttggtgaga 
aaaacctatc 
ctccacggcc 
gtcatatcca 
tatcgagtgc 
gtgccagaag 
tactggaagg 
56 



ggagatcgtg 60 
actactatgg 120 
gagccaagcc 180 
gttcgaggta 240 
tggatctggg 300 
agggacatgt 360 
ttgaacacca 420 
gcatcgagaa 4 80 
actgtgttat 540 
cgaagcacgg 600 
acatcaagtc 660 
atcttgccat 720 
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cattgcccaa aagattgt tctgccctcc acgtttggtc actgccgata tc ^actgt 780 

tgaaaacaag gagctcgaag gggttcttgg tgactgtcgc tttgtgtctg ccacacttcg 840 

cctcttcaaa ctccctaaga cagagccagc cgaaagatgc cgagttgttt acaatggagg 900 

aatcaaggga catgaaaagg aactaatttt ccatgcaaat ttcacattca aggaaggcga 960 

agctgttgca gtggatgagg agacggcagc rgtcctgaag aactcacgtt ttgctccgga 1020 

ttttctcttc acacctgttg acgcctcgct gccagctccc caggggccgt tctgagttag 1080 

agacaaaggt tctaatcaga gatccattca agcttgcaga ggactctgac aagatgaagc 1140 

ccagacatgc acccgaaggc acgggaggct gctgtggcaa gaggaaaaac tgctagatct 1200 

acagccagcg cggagcccac cgggctcaag agggtggcta aaggacagtc acagaggctt 1260 

cttagcctgc tcttcgccag tgcacagatt atgtgaaggt ggcaaagcca ccacaagcta 1320 

gaccactgct aagaataaga gtgactttta gaggatgtta attgaaggtt cacagcaaat 1380 

cgcctgcttt tctatttctc tatctcagag ttctggtgcc acctagtggt cagaagtaga 1440 

acttggaagc ccaaggttta ctcaaagggc caaaggcatc atcaacgttg tgagaattat 1500 

cttccttctg gcctaccaca ggacacctct gggttcttct ctgtggttac caggaagcac 1560 

agtacttact aaatttatgc taaccatgac aaaagattgt caactcaaat ttgctaggag 1620 

tattctttag gttgctgtct gcaatttttt tctctgtaac tgaatgaaaa agaaaacaaa 1680 
taaaaaataa atttgacttc gaaaaaaaaa aa 1712 



<210> 15 

<211> 1539 

<212> DNA 

<213> Mus musculus 



<400> 15 

cggacgcgtg 

cgtaaagttt 

atcaggggta 

cgacaggtcc 

tgtatctgtc 

taccaccact 

catctgtcag 

gaatccaaaa 

agtgaccgaa 

aggaccctac 

ccaggcctgt 

gttctcgttt 

cccgcgtgcc 

tcttccagag 

cctagagccc 

aaagctgcct 

aagaaggacc 

gcacggcgga 

atgggggtcc 

atctaccata 

gcccagtacc 

ggagcgcagg 

ggccccgccc 

tgatcaccca 

aagccaggtg 

attgaacagt 



ggggagtctg 
ggcaagaaat 
gctcggctag 
acagggccca 
ctgcctgcgg 
gagcgaagcc 
ctggccttcc 
aggggctttg 
tttccggtga 
ctcctggtgc 
tttagctggc 
gaggctggcc 
ccagacatat 
ctggccatgt 
cctggagagc 
ctaactgatc 
ggcatccggt 
gcatttctat 
tgaagcccaa 
acactgagga 
ggaggctgct 
caggcatcaa 
cctgtgaccg 
agaccaggaa 
gcccggggag 
aataaagctt 



tggagccccc 
gctggcgcaa 
aaagctggga 
gccgtcgagg 
atggcgagag 
acctgttggc 
cgggtaccgg 
ttcccatgga 
tcgtgcagaa 
tgggccaaga 
cctaccgttt 
gccgctgtga 
gtggggttgt 
ccccaccctg 
ttcgggaggt 
ccgggcctca 
ctctatgcgt 
gagaacgtgt 
gagggccccc 
tctgagttgg 
ggaactggag 
ggccaagctg 
gccctgaagg 
caacttcgaa 
gagccacact 
gcctatcaac 



ggccaaagac 
agtgtgggct 
cgtgcgtgat 
ggaacgccgg 
ctgtcccagg 
tgcacagcac 
agaatgttcg 
agaaaactct 
gacagaagcc 
tgacatccaa 
cctgcgcaag 
cccaggtgag 
ggctgccgcc 
ccccctgcct 
ggccccagga 
aagcctacca 
ccgtgtgcaa 
gcatgctgga 
ttggtggccg 
ccgggctcgg 
ctggatgagg 
gtgaccctgc 
ccTigagcggc 
cataacccgt 
ctigccctacc 
ttcaaaaaa 



ggcatcctct 
ctgctgtatg 
ggtggcctgg 
gtcatacgct 
gacactggtg 
cgccagtcct 
tcaggatcag 
atctactcct 
acctcccgct 
ctgagggaga 
tacggctctg 
ggcctttttg 
attgcccgcc 
cgggccctct 
tttgagctgc 
ttgctgctca 
gcagaccagc 
ggccagcctt 
cagccccctt 
cccaggacag 
ccggaagcgc 
tgacccgtga 
cagccactgc 
ctactctgac 
tcctccctca 



accagcagca 
cgggaggccc 
gaccagcagg 
tggctgactg 
ccttcctgat 
gggtggaccc 
gacaggctga 
cctggcagga 
gccagctgaa 
catccaagcc 
acaagggtgt 
cctt cagtag 
agcgggagcg 
ccctgccctc 
ccaczcccaq 
gccccaacac 
aagcacacag 
gggcrgacca 
gggcagcctt 
caatctggaa 
cggccgctct 
acggaagaag 
aggacagagg 
ctgcagggac 
gactgtacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1539 



<210> 16 

<211> 3599 

<212> DNA 

<213> Mus abbotti 



<400> 16 

agagacagcg tgatcccggc ctcccacggg 
caaagtccat ggagtctgaa gactctgtca 
gggcacttgg caccggaagc tagccgggta 
ctgcctagac aggctcagac taggtgctgg 
caaaggctgt taacagaagt gcctgcctag 
gcgagacgga ggtaggagta taatggctgt 
actgagtctg ttttctctgg tgcaaggtgc 



gcagctttta ctgtctaggg aagaaatccc 60 

agcctcgcta ggaaacctag gagt*ttaga 120 

ggcggagcct cacccggatt gagttcacag 180 

gcacctggga ggaggaggag acacragcag 240 

gcctggaggc aagacgctgc tgttcacagt 300 

ccaggtgctg cgccagatgg tctacrtcct 360 

acacagtggc agcccccgag aagacrt ccg 420 
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cttctgtggc cagcggat agacccaaca gagcaccctc cactatgatc aa ;tcaga 480 
gcctcacatc tttgtgtgga acacagagga gaccctcaca attcgtgccc ccttcctggc 540 
agccccagat attccccgct tcttcccaga gcctagaggg ctctatcact tctgcctcta 600 
ctggagtcgc cacactggga gactccactt gcgctatggc aagcatgact acctgcttag 660 
tagccaagcc tccagactcc tctgcttcca gaaacaggag cagagcctga agcagggagc 720 
cccgctgatc gccacctctg tcagctcctg gcagattccc cagaacacca gcctgcctgg 780 
ggctccgagc ttcatcttct ccttccacaa tgccccacac aaggtctccc acaatgcatc 84 0 
tgtggacatg tgtgatctca agaaggaatt gcagcagctt agcaggtacc tgcagcaccc 900 
tcaaaaggct gccaagcggc ccaccgcagc gttcatcagc cagcagttac agagcctgga 960 
gtcaaagctg acccctgtga gcttcctggg agacacatta tcctttgagg aggaccgggt 1020 
caatgctaca gtgtggaagc tgccacccac agccggtcta gaggatctgc atatccactc 1080 
ccagaaggag gaggagcaga gtgaggtcca ggcatactcg ctgttgcttc cccgggccgt 1140 
attccagcag accagaggcc gtcgccggga tgacgccaag aggctcctgg tagtagactt 1200 
cagcagccaa gctttgttcc aggacaagaa ttctagccaa gtcctgggtg agaaggtctt 1260 
gggtattgtc gtgcagaaca ccaaagtcac caacctctca gatccggtgg tactcacctt 1320 
ccagcaccag cctcagccaa aaaatgtgac tctgcagtgc gtgttctggg ttgaagaccc 1380 
ggcatcaagc aceacaggga gctggagcag tgcaggctgc gagacagtga gcagagacac 14 40 
acagacatcc tgcctgtgca accacctgac ctactttgca gtgctgatgg tgtcatccac 1500 
agaggtagaa gccactcaca aacactacct cacgctcctg tcccacgtgg gctgtgtcat 1560 
ctctgctctg gcrtgtgtct tcactatcgc tgcctacctc tgctccagga ggaagncacg 1620 
tgactacacc atcaaagtcc acatgaacct gctgtccgct gtcttcctgc tggacgtgag 1680 
cttcctgctc agcgagcctg tggcactgac gggctccgaa gcagcctgtc gcaccagtgc 1740 
catgttcctg cacttctccc tgcttgcctg cctctcctgg atgggcctcg agggctacaa 1800 
tctctaccga ctggtggtgg aggtcttcgg tacctatgtg cccggctatc tgctcaagct 1860 
gagcatcgtg ggctggggtt ttcctgtctt cctggtcact ctggtggcgt tggtggatgt 1920 
gaataactac ggccccatta tcctagctgt gcgccggact ccggaacgtg tcacctaccc 1980 
ctctatgtgc tggatccggg actccctggt gagctatgtc accaacctgg gcctcttcag 2040 
tctggtgttc ctgttcaacc tggctatgct ggccaccatg gtggtgcaga tcctgcggct 2100 
tcgcccgcac agccagaact ggccccacgt gctgaccctg ctgggcctca gcctggtcct 2160 
tggcctcccc tgggccttgg tcttcttttc ctttgcttcc ggcaccttcc agcttgtcat 2220 
cctctacctc ttcagcatca taacttccta ccaaggcttc ctcatcttcc tgtggtactg 2280 
gtccatgcgg ttccaggccc aaggcggccc ctcccctctg aagaacaact cagacagcgc 234 0 
caaactcccc accagctccg gcagcacctc ctccagccgc atctaagcca ccgccacacc 2400 
tcccctccgg gaggacacat gcatggcgtc cgctcacgat gtctgtggcc cagtgctgtg 2460 
cccacccagc ct'tgttggt tagtggcata ctagagaagg ccctggtcct tgaaggcgta 2520 
gggctgttgc tctggtaggt agatacctag cttgccttgg ggacgactct ggtcctcaaa 2580 
gggctcagaa gcacactgcc attctgtttg tggggccgtt tcagtctgga gctaaggcct 2640 
tgtctttctg gccacctctg ggtccagctg ttgctgctgg gtgttgagac ctgcagaccc 2700 
aagctggggt tagatctcga aggaggctga cacatccggc ctgagacaca gctaactgtc 2760 
ttgacttgct gctctgtctc tgtggtcacc atgcagatcc cgagggtggc actgggggta 2820 
aatgttctgg gagaaggttt ggaggcagag caccttagga gctgagcatc tcccccagcc 2880 
tttctgcaaa ccctcctctt cattccccat ccccaacccc tcctctgtgt tcccctaacc 2940 
ctccacctga agcctggggt cctagaccaa tgctgtgatt tggggtggta gttcccagca 3000 
gtttcctggt gccagctatc aacttctgtc tgttgtgtgg gctttggcct ctgactcagg 3060 
gcaggtttct gccrgagccc tctctccaag ctgcctcacc tttgctcgca cctcagaggg 3120 
acctccatct ctcctgaagc ctcctccctc tggcaagtac tgggatacag ccaccctttc 3180 
aacccagcac tctgaagacc aagacagccc cctctggtga cactggccaa gcttgatctt 3240 
tttcctaaga agrggtcttc agatccccgc aggtcgctca gaagacactg ggctgcctag 3300 
tgtgaattct gtcctactaa cgtacagtga gcagctcctc acccccaccc ccgcaaaagc 3360 
tctcaccaag tccizggagtg tcaggcaggg ggctggaaat ccaggaggac ttcctgcaaa 3420 
aggcagcatt tcatcttgac ctcagccttc aggttgggga gaatgttctt tttaaatacc 3480 
agttcatttg tcttttgata ttaaagctct ttatagagag tctggaaact gtaggcgatt 3540 
gtcgagaaga gaaataaaaa tgagctgtta tctaatgcca tggcaaagca gcacaaaaa 3599 

<210> 17 
<211> 399 
<212> DNA 

<213> Mus musculus 
<400> 17 

agcgacatgg ccccgcccgc gctccaggcc cagcctccag gcggctctca actgaggttc 60 
c^gctgttcc tgctgctgtt gctgctgctg ctgtcatggc catcgcaggg ggacgccctg 120 
gcaatgcctg aa=agcgacc ctccggccct gagtcccaac tcaacgccga cgagctacgg 180 
ggtcgcttcc aggacctgct gagccggctg catgccannc atagcgagag gactcraact 24 0 
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ctgaaccaag ccczgac. ctgtcnggat actcantcca cangtgagat tc jtccac 300 
ggccagctgc tacrccrtcgt caaccgggcg tcgctgantc anggtctccc cgaacctacc 360 
ncgtgcancn agcgctgctc ctgctgacnn cnaangccg 399 



<210> 18 
<211> 400 
<212> DNA 

<213> Mus musculus 



<400> 18 

cggacgcgtg gggaaagcgg gtggggatat acgtgtgctg gtgccctaca attcgacagg 60 

. cgtcttggga ggg^cgacca ccttgcactg tactctgact tctaatgaga atgtgactat 120 

cactcaaata accrggatga agaangattc aggtggaccc cacgctcttg tggctgtctt 180 

ccaccccaag aaggggccca acatcaaaga gccagagagg gtgaaattct tggctgccca 240 

acaggatctg aggaacgcat ctctggccat ctcgaactta agtgtnnaag acgaangcat 300 

ctatgaatgt cagattgcca cattccccag aagcagtana agcaccaatg cctggctgaa 360 

ngtgcaagcc cgacetnana acactgcana agccccggac 4 00 



<210> 19 

<211> 2313 

<212> DNA 

<213> Mus musculus 



<400> 19 

cgaattcctg gg^cgaccca cgcgcccgag 
gaggtcaaat ggaccccatg gaactgaaca 
gcagcgggga cagtattcag gacagctaca 
tgaacagcca at-rgctaat gaagatgccg 
tagggaagaa gaagctagcc gattacgcgg 
gaatgtcctc at"aacctg agcaacgcca 
atgccatggc caacaccggg atcatccttt 
tctcgctcta cteggttcac cttttgctga 
atgaaaaatt gggcgagaaa gcatttggat 
ttacaatgca gaacattgga gccatgtcaa 
ctgaagtaat cagagcattc atgggacttg 
gcaactacct cgtcttattt gtgtccgtgg 
atttaggcta ccrrggctac accagtggat 
gtgtggtgat ttscaaaaaa ttccaaattc 
acggaaatct gacgttcaac aacacacttc 
cggagagctc ggg^gtgaac ttcatgatgg 
atgagaagca ggrcgcaggc cctcttcaca 
ctgagaaatg ccaaccaaaa tactttgtgt 
tcctggcttt tgcttttgtc tgccaccctg 
atcgatcccg cagaaagatg cagacggtgt 
tgtaccttct tgcggccctc tttggttatc 
tgctgcatgc ttacagcaag gtctacacat 
cagtcctggt ggcagtgaca ctgaccgtgc 
tgatcacact gczgtttcca aggaaaccct 
caatcatcat cgcactcaac aacatcctgg 
ttggattcat aggggcttct tctgccacta 
atctcaagct cg-caagaaa gaacctctaa 
tccttgtgac tcgaattatt ttcatgatgg 
tctacaaccc gccgaatccc aatcaccact 
cagctgaaat tg-ctgaagg acattttagt 
tagcaagatt ccsaagacgt ttttctagct 
cagtgcagtg tg^gatgccc gagttgtgtt 
gctgcgtgtg ttggtcagca gacaatagcc 
acccccagat taacaggtaa ttctactctc 
atacttgtgg aagagaaaat tatgggtttt 
ggatgcatta aaaattctgt gcgaagtatc 
aacactaagg gtcggttgac tagctgaggc 
atcatgacgt acgacggttg ccagtataga 
tgcccaacct gcrtattaca ccgagttagt 



agggtagaca gaaaggcggg aaagggctgt 60 
acgtcagcat cgaacccgac ggagacagct 120 
ccggcatgga aaactccgac aaggacgcca 180 
aaagtcagaa gttcctgaca aatgggtttt 240 
atgagcatca ccctggaatg acttcctttg 300 
tcatgggcag tgggatctta ggcttgtcct 360 
ttataatcat gctgcttact gtggcaatac 420 
agacagccaa ggaaggaggg tctctaatct 4 80 
ggcctgggaa aattggagcc ttcatctcta 540 
gctacctctt catcattaag tacgaactgc 600 
aagaaaacac tggggaatgg tacctcaacg 660 
ggatcatcct cccgctctct ctccttaaaa 720 
tttctctctc ctgcatggtg tttttcgtca 780 
cctgccctct gcctgctctg gatcacaaca 840 
cgattcacat gatctcgctg cctaatgact 900 
attacgctca ccacaaccca gctgggctgg 960 
gcaatggcgt ggagtacgaa gcccagggtg 1020 
tcaattcccg gacggcctat gcaatcccaa 1080 
aggtccttcc catctacagc gagcttaaag 1140 
ccaacatttc catctcaggc atgctcgtca 1200 
tgagcttcta cggggacgtt gaagacgagc 1260 
ttgatacggc tcttctcatg gtgcgcctgg 1320 
ccatcgtgct gttcccgatc cgtacttcgg 1380 
tcagctggct gaagcatttc gggatcgctg 14 4 0 
tcatcctcgt gcctaccatc aaatacatct 1500 
tgctgatttt cattcttccg gctgcgtttt 1560 
gatcacccca gaagattggg gctttggtct 1620 
gaagcatggc gctcattata ctcgactgga 1680 
aatcccgggg agacgcgtct ccactggaaa 1740 
tgtcttgatt gggatgttag tctgaggaat 1800 
ctatcagcac acattttaac ccaggccgtg 1860 
tgcagcagct gtgcaagctg aagcctgttg 1920 
tgtcccccca tggtcactcc acttctctcc 1980 
agaacatcag acaaagacct cctggttggg 2040 
gttgggaatg gttttgttgg gaatggtgaa 2100 
atcagttacg gccatctctc actctacacc 2160 
agggggatat cttgggctgt ccctgtgagg 2220 
gtacttcatt tcaatactca aggaatagtt 2280 
gaa 2313 
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<210> 20 
<211> 3408 
<212> DNA 

<213> Mus musculus 
<400> 20 

gcagctaccg gcccggaatt cccgggtcga cccacgcgrc ccacggctgc gagaagatca 60 
cagaaggggg cggtgacgca ggaccaggac tcgcgcgtcc agcggagaag caggagaagc 120 
cggcgacctt gcgctctcag cctgatccct gtcttggcgg cctgaacatt cgcagctgga 180 
gagatggcgt tcgtgaagag tggatggtta ctrcggcaga gcaccattct gaaacgctgg 240 
aagaagaatt ggttcgacct gtggtcagac ggtcacctga tctactacga tgatcagact 300 
cggcagagca tagaggataa ggtccacatg cccgtggact gcatcaatat ccgcacgggg 360 
catgagtgcc gggacatcca gcctccagat gggaagccca gagactgtct gctgcagatc 420 
gtttgccgag acgggaagac catcagtctc tgtgcagaga gcacagacga ttgcctggca 4 80 
tggaagttta cactgcagga ttccagaaca aacacagctt acgttggttc agcaatcctg 540 
tctgaagaga ctgcagtggc cgcgtccccg cccccctacg caacctatgc tacaccnacc 600 
cctgaggtct acggctatgg tccatacagc ggcgcatacc ccgcaggaac tcaagttgtc 660 
tatgccgcca acgggcaggc atatgcagtg ccataccagt acccgtatgc aggagtttat 720 
ggacaacagc ctgccaacca agtcatcatc cgcgagcggt accgagacaa tgacagtgac 780 
ctggctctgg gcatgcccgc cggggcagcc accggcatgg ccctgggctc tctgttctgg 840 
gtcttctaga gccttcaaca ttttctgtgc atagcttctg ttagtcctgt gtgcagtaat 900 
ttgatttgca gggcatttct gtttgtgaca agtgtctttc ataataattt aaatagttct 960 
tttgaaggtg gtaatctaat aattgtgact gacctgcatg gtaccacaaa gaaagcccga 1020 
ggtatgctgt gagtgagagc ctgagtcctt ccgggtacta gcttgcacca agtctttctt 1080 
agggactttt ggatggcttt atgtaaacac acccagttaa atgggcaatt tccgtccagt 1140 
taggtgcagt gttgaattaa gggatggctt tccttgctat gccaatacta atactgctga 1200 
tggaggaaga tgtgtgcaag tgtggtgagg agagtcacag cttctttaac tgtggattct 1260 
cttctagacc cctgctgcgt gttaccctag gagctgtggg ctggtggctc ctgcaagact 1320 
atggtgtgag gaccctgtaa cgtacctctt ggagcactta ggtacccctt gaagctccta 1380 
ggtatcacca gcaggattgg ctgctcagga tgcagagggc caccccctcc ctttaaaaat 14 40 
tacgctccag taatctgccc agttttattt tcttgttatt cttctgtttg cttttcctgg 1500 
ggatgattgg cattagtctg gagttaggaa ttgattcgag tgccggtggg tggaggcatg 1560 
cagggagctg tccagcgacc tgctctcagt gtttgtttta ggtatattga ttgccagctc 1620 
aggctgcaga gagcctatag agactatttt tctacttgta aagaaagtat ggtgagggga 1680 
attgaggaga gccrtgtttg aatgttcctg cctcaggcct cctggggccc acactgcgtg 1740 
gtcctgggga ggcttttcct cagcagtgga agggaggccc cgtggctgtc cagagtctca 1800 
ggttttgagt gagaaatggg attgggtaga gcatctcagg gatttgttct aatccctcat 1860 
gttatgggga tccagccgtg ttctcagtcc agacccgcrc acctcagaag agcttaaaac 1920 
atttctggtc cccaaatgtg tggcactctg agaagctcac aatctggctt tctaacgaaa 1980 
atttgtattt ctaaaattag agaatacatg ttccacgcat ttaaaattta tgttctttca 2040 
tgttttaaag ctcccaaatc cagctttgtg actggcatat tttagtttca aacagtaccc 2100 
cggcacaaag gtgggatggc acagtgaagg ccccccgccc tctactttgc atagtcttgt 2160 
ttctccaggg tgctcccagg aagcattcat tctgactttg ctcagcccag tgcatgcgtg 2220 
ctgccttgcc gccgtgctgc tgggtagctc tttcttggtc agatcaagtc ttcaacagat 2280 
ctccatgtga gacagttgcc aagtagatga ggtggtgccc atagtgcttt ctcgatactc 2340 
cttggggacc tgttgacacc tgcccatttc cagctgacat ttgtttttct gtcatctctg 2400 
atagatggga ta-gtgacaa catggtacgg acgccgttca gtgtcgcttt aataagcatg 24 60 
atgctgattt tacatcctgt gctgtatgac tgccatttgc tcacagtgtc accattgcta 2520 
aagctccgtg ctttacttac aaaacactaa aaccagtggt tagtgtttca cagtgatttt 2580 
aattttagag tcagttactg gcattcctaa agccatagag tactgagtca catccctgaa 2640 
gtacttttga aacagaattg tctcctactg tcccatgggt gtgccctgcc tgtctcctgg 2700 
ccccaatggg gccagctgta ccaggcagcc atagttgagc ctgatcattc ctgtcaccag 2760 
tttgacttga ttatataccc agaatggaat acattcttgg gcatctcagt tcctcagccc 2820 
tgatcctcat agacgccacc ctttcgatgg cttttgcggc gtcacttgta cctcagtgag 2880 
rcctgcgatt cttgagttag aggggacgac ttgtccagca ttgaggaaca tgtctcctcc 2940 
actgagactt aaatgatgat gcagggctgg aagaggctgg ctgctgacac tgcatcgrgg 3000 
ctgatgtcat tgstctccta gttctttgat ctaagaacct ttcatatgga aggcctgagg 3060 
ctccctcaga tcg-cccttg ccaagaaggc ctggcttagg tcattagtgc ccacagtagc 3120 
cttctggagt gcagcaagtt cctgcgtttg agacagaatg gttcagattt attttctaca 3180 
~ctgttgttg accccatgca ccctctcatt ttgccttcca gtctacgtag atgaaagatg 3240 
aaaggcagag ga^gcagaca gtcttctttg tgattgcttc tgttattctg ttgcatctac 3300 
cgagcccgtt trc::ccctgt ctgtgcatac agtatgttta taagtgaact tgttaaaata 3360 
wtaaatgatc ac-aacccta aaaaaaaaaa aaaaaaaaaa aaaaaaaa 3408 
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<210> 21 

<211> 878 

<212> DNA 

<213> Mus musculus 



<400> 21 

aaatctcagc tgtgcagcct tctagaacct catgtctaca aagtgacaaa ctttcctcag 60 

gcacaatcca tcatacagtg gatcaatcag tcagagtcag aggaagaact agtcaaaatt 120 

acctcatttt ctgaattcat taacacctta aagaaaaccc acaaatacct aatggaagag 180 

agtttcaaaa ctgagccccc agaaagagtg gaagaagcaa agagaatggc tacatatgaa 24 0 

gtcaccacag ctctcagctc cttcttgaag cacctcagag aaacacagca gccagacatg 300 

cagctgttgc tactctccat tgctactggt gtaggctatc agttggtaaa cagtarcttt 360 

cagcatcttc tggggtgtga tgagttaaac ttcctcttgg atcaaacgga aaataacgaa 420 

cataaatacc aagaactgaa aaatatttgc aattacagag cccaggcatt cttgg-cgctc 480 

acagccctaa gagccacagt tgaaatcaca gatgtttcta cagaagagaa aggacaacgt 54 0 

ttgacattaa tacaacaaca tatggggtca ctgttgtctg aagaagttgc acatgtcctc 600 

acaaaacatg gagaacatca tgactgggaa aggctggaga atcatttgag attactcatt 660 

gagggggact ataaagccac cacccattcc tracaaatgg atgaagtaaa aaaacaattg 720 

caaagtttnt gccatgaaaa gaaacagact tataaacaac aaggtaatga aaacagaaca 780 

aaagaaatga tagaaaatgg acatttcctg gacttactcc aacgtittagg cctagacaat 84 0 

tactatccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 878 



<210> 22 
<211> 731 
<212> DNA 

<213> Mus musculus 



<400> 22 

actagactct ggtagagaaa aaagtgcctc ggagtggctg ctgaacaacc tcatgactca 60 

ccagagtgtg gagcttttca naaaaactca cttntaaacg tcacgaaaca gctcattcgt 120 

tcntctaact tatttgtgat gcaagtggaa atagatgtnt atacagctct taaaaagtgg 180 

atgttccttc agctggtgcc ntcctggaat ggggtctttg aaaacagctt ttgacaaaaa 240 

cagatgtctg gttttccaag tggaaagaag actttgaaag gacgactttc cntgaaactg 300 

aacagggaaa accagntcgc gcccntgtnc cgacatttaa ggc-acagta cattatcagt 360 

gatctggctt ctgcaaggat cattgagcag gattctctgg tacc-tcaga atggctggcg 420 

gcagtgtata aacagcagtg gctggctatg ctacgggctg aacsagacag tgaagtggcg 480 

cctcaagaaa tcaataaaga agancttgag ggaaacagca tgaggtgtgg tcgaaagctt 540 

gccaaagatg gtgngtactg ctggcgctgg acacgcttca att.cggctt tgacctcctt 600 

gtgncttaca ccaatcgata catcattttc caacgccata cnczgaacca gccacgttat 660 

tggatctgtc agcttacacc tcgaaggagc atagcattta aatgcccttg gcttctttga 720 
cagtagtggg a 731 



<210> 23 
<211> 400 
<212> DNA 

<213> Mus musculus 



<400> 23 

gacatttaag gctacagtac attatcagtg atctggcttc tgcaaggatc attgagcagg 60 

attctctggt accttcagaa tggctggcgg cagtgtataa acagcagtgg ctggctatgc 120 

tacgggctga acaagacagt gaagtggggc ctcaagaaat caataaagaa gaacttgagg 180 

gaaacagcat gaggtgtggt cgaaagcttg ccaaagatgg tgag-actgc tggcgctgga 240 

cacgcttcaa tttcggcttt gacctccttg tgacttacac caa.cgatac atcattttcc 300 

aacgccatac nctgaaccag ccatgttatt ggatctgtca gcrtacacct cgaagganca 360 

tancatttaa acgcccttgg cttctttgac agtagcggga 400 

<210> 24 

<211> 2649 

<212> DNA 

<213> Mus musculus 



<400> 24 

tgagcgcaac gcaattaatg tgagttagct cattcattag gcaccccagg ctttacactt 60 
tatgcttccg gctcgtatgt tgtgtggaat cg^gagcgga taaraatttc acacaggaaa 120 
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cagctatgac catgattc J ccaagctcta atacgactca ctatagggaa ac, .ggtacg 180 
cctgcaggta ccggtccgga attcccgggt cgacccacgc gtccgacggc tgcgagaaga 240 
cgacagaagg gggcaccaca gaggacccct tcaacccaag gaaacaccca ggctrcgata 300 
ggaccatgct gcagaggtgg cagaaaaggg agatcagcaa ctttgaatac ctcatgtacc 360 
tcaacacact ggccggaagg acctacaatg actacatgca gtatcccgtg tttccctggg 420 
tcctcgctga ctacacctca gagatgttga acttgacgaa tcccaagact ttccgggatc 480 
tttctaagcc aatgggggct cagaccaagg aaaggaagtt gaagtttacc cagaggttta 540 

aagatgttga aaagattgaa ggagacatga ccgtgcagtg ccactactac acccactatt 600 

cctcagccat cattgtcgct tcctacttgg tccgaatgcc accattcacg caggccttct 660 

gctccttaca gggcggaagc tttgatgtgg ctgatagaat gttccacagt gtaaagagca 720 

cgtgggagtc tgcctccaaa gagaacatga gcgatgtcag ggagctgaca cctgaattct 780 

tctacctgcc cgagttttta accaactgta atgcagtgga gtttggctgc atgcaggatg 84 0 

gaacgacact gggggatgtg cagcttcctc cctgggctga cggggatccg aggaaattca 900 

tcagcttgca cagacaggct ctggaaagtg acttcgtcag cagcaacctc caccactgga 960 

tagacctaat ttttgggtat aagcagcagg ggccggctgc tgtagaggca gtgaacactt 1020 

tccaccccta cttctacggt gatagaatag acctgggcag catcactgac ccgcxgatca 1080 

agagcaccat cctgggcttc atcagcaact ttggacaggt gcccaagcag atctrcacta 1140 

aaccccaccc atccagaaac accacaggga aaaacccagg gcccggaaag gatgcrtcca 1200 

cccctgtagg cc^cccaggc cactcacagt ccttcctcca cagcctgcca gcaccgagac 1260 

cctctcaggt cacagtcaaa gatatgtacc ttttctctct agggtcggaa tcccccaaag 1320 

gggccatcgg ccacatcgtc cctactgaga agtcaatcct ggcagtggag aagaacaagc 1380 

tgctgatgcc ccctctctgg aacaggacct tcagctgggg ctttgatgac ttcagrtgct 1440 

gcctggggag ctacggctct gacaagatcc tgatgacctt tgagaacctg gctgcctggg 1500 

gtccctgtct gtgcgctgta tgcccttccc ccacgatgat cgtcacatcc ggggccagcg 1560 

cagtggtgtg catctgggag ctgagcctgg tcaaaggtcg cccgagaggt ctgaaactcc 1620 

gacaggcctt gtatggacac actcaggcgg tcacatgtct gacagcctct gtcaccttca 1680 

gcctcctggt gagcggatcc caggatcgca cttgtatcct gtgggacctg gaccacctct 1740 

ctcgtgtggc ctgcctgcct gtccaccggg aaggcatctc agccattgcc atcagtgatg 1800 

tctcgggaac cactgtctcc tgtgccggag cccacttgtc cctgtggaat gtcaarggac 18 60 

agcccctggc cagtattacc acagcctggg gcccagaagg aaccataacg tgctgctgca 1920 

tagtagaggg gccagcgtgg gatgcaagcc acgtgatcat cacggggagt aaggacggaa 1980 

tggttcggat ttggaagaca gaggacgtga agatgcctgt tcccaggcag gcagtgatgg 2040 

aggagccctc cacggagccc ctaagcccca gaggtcacaa gtgggccaag aatcttgccc 2100 

rgagccgaga gctggatgtc agtgttgctc tgagtggcaa gcccagcaag gcaagncctg 2160 

ctgtgacagc tctggccatc actaggaacc agagcaagct cttggttggc gatgagaagg 2220 

gcgaatcttc tgctggtctg ctgatgggta ggagacaagg agctggaggg accgacctga 2280 

aagcctggag ccctggggtc ggcagcaaca ggctacaggc acaagatgat gtgtagcctg 2340 

ggctgcttaa ccagagcaag ttttgggggg gctccactcc acacagttct caaggagtcc 2400 

ctgatggttt gcaccgtgta ccctaaacat gtctgtagtc tatgggactt ctgtaagaag 24 60 

gatcttggta gacactgatg ctggaaactg acgctgcatg ggaaatttct acgtggctca 2520 

cttcaccaag gcttattgca ctgggaaaag aagagggggt gggtattggt tcatggaaac 2580 

caccccactg tctttatttt attaaaactc cattttccga aaaaaaaaaa aaaaaaaaaa 2640 

aaaaaaaaa 264 9 

<210> 25 

<211> 499 

<212> DNA 

<213> Mus musculus 

<400> 25 
cccattactt 
attactgtat 
aaagacagag 
cactggactc 
caacatgatt 
attttagagg 
ccaaggcaat 
^aatgtcgca 
a-gtgttctg 

<210> 26 
<211> 686 
<212> DNA 

<213> Mus musculus 
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ccacctccca tgaggactgc acctgcggca gcgattttat gtgaacttgg 60 

ggaaatggag gagtgagaaa gtgtgggaat cataaaagaa ggcagaacca 120 

ggatcttggg ccatgagtca cagcaaggaa agcctcccac caaacatctg 180 

ctacaggaac aagaaatgaa ctatcgtgtt cagcccctga gacttgggct 240 

ttcacattgt ctaccctacc taatagagca gagatgtaaa tattactctt 300 

tgcgatgcct cagctgcaat gggtgagaac tactcctcat ttatcctctt 360 

aaaagagaat ggaccaaaga cagtctgtca tcacatctag tcaaaagagc 420 

gtacaactct tcaaaagaaa aagaaaaaac aagaaaaaag taataaacag 480 
ctrgaaaaa 499 
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<400> 26 

cggggtcgcg cgggaattaa 
gataagaagt t"~ ctgaac 
ggagagctca tastcgtaga 
tcagctcttt ccagacaatg 
tagaaagaaa c> . -ggtcaa 
cagttatgac tra-tccaaa 
aaactcgtcg acctccagga 
gtgggactag aaaaaatggc 
ataaagtccn nnganccnag 
tcccttatga aargaaaatt 
aaggtttttg aaattttcaa 
aaaaaatttt t-gggccctt 



gtgagtccat tcttgcacac 
cctttgggaa gagacttgct 
ggacataatg atggtgaaga 
gaccaaatgg tgtctaatat 
ctttcagtgg atccaaatgg 
ataggagatg aaccgccaaa 
agaataaagg aaaccaggaa 
tattggtcnt catatccatg 
aaagactggg agatgaaaaa 
gatgccccct gccttttgaa 
accccggaac aacccaatcc 
aaaaan 



cgagaaaata tgcgacagat 60 
cagtatctct gatggtagan 120 
ttctttgact catacagatg 180 
gagaaactat atgcagaaat 240 
acattcattt tgttcttcct 300 
ggtttttcat gcctcaactc 360 
agcnatgaga gattctgaca 420 
aaccgagctc ntgttcntta 480 
aggtccaccc aggaanttca 540 
tgaaggaatt ggccaaattn 600 
tnggaaaacc cctnnaaatg 660 
686 



<210> 27 
<211> 651 
<212> DNA 

<213> Mus musculus 



<400> 27 

caaaannacn tgzgtttagc aaataaaaac tgcccagtgg gacaancgtc gccgggaatc 60 
gctgtcagta cccccgantt canaaatgcc tggctgttgg ganggncaaa gaaatggnnc 120 
gcacagacag nr.naaaaggc cggagaagnc gnntgcccnc gaaaccgaag aaccacanga 180 
nccctctccc cc^tcgcccc cgttccangc gaacctgacn ancaaangaa nggagangan 24 0 
acccagcana nccngcaant ctatganctc ctgactggct ccanggaaan catcccgggc 300 
ngggcagaaa aaatccctgg gcnncgcaga actgccccaa gccgaccaag aactgcnnnn 360 
tgaancagcn ntcrnagaac tgnttgtccn nccannagca nacagaagat gaacatcgac 420 
atntctgcct tcrcctgcat ngcngccctg ggctatggnc acnnaaaaac acgg.cncngg 480 
gaaccggaaa ar.ggaanaan gncnnccnga tngntgantg ncgcggagac acgngacttt 54 0 
cnacnanggg gcgggnannc gccccccttn nttgttccna ctgggggggg gaacnncccn 600 
aaattcgngc ccctnngggc ccnagggggg ggggggnccn ttttctttct n 651 



<210> 28 
<211> 683 
<212> DNA 

<213> Mus musculus 



<400> 28 

gggtctgttt ggactgcaga agcaaggggg tgatgtagcc catccttccc tttggagatg 60 

ctgagggtgt ttcctcctgc acccacagcc agggggatgc cactcctccc tccggcttga 120 

cctgtttctc tgccgctacc tccctccccg tctcattccg ttgtctgtgg atggtcattg 180 

cagtttaaga gcagaacaga tcttttactt tggccgcttg aaaagctagt gtacctcctc 240 

tcagtgtttt ggactccatc tctcntcctc cantaccttg cttcttactg ataattttgc 300 

tggaattcct aac-tttcaa tgacattttt tttaactact atattgattg tcctttaaaa 360 

aagaaaagtg catatttatc caaaatgtgt atttcttata cgcttttctt tgttatacca 420 

tttcctcagc ttatctcttt tatatttgta gganaaactc ccatgttatg gaatcccact 480 

gtatgattta taaacagaca atatgtngag tgccttttgc aanaaaaagg gtgtgtttga 540 

aatcatccgg ar.tcagccan gaaactgtct ccaaaggaaa cnccnacctc tctgttcctt 600 

gctgttntgc tcarcatccg ccanaaggtg cttccccccg aattttggtt tggttntgtt 660 
tnccgaaaaa ttr~ttccgn ttn 683 

<210> 29 
<211> 624 
<212> DNA 

<213> Mus musculus 



<400> 29 

aacgcacctc c~-gtattcc cactttcgta 
gcctgaccgc t-crgacgcc gggatggcct 
ctgttttttt trganggttt tcaaaacatt 
ccatganttt tcaaaaaaat ccggggttgt 
gcLgcgttgc c-_rtttttt taatggggtt 
ctctganatc gaacngactg tgaatccgct 



ntcatttcgg ttctgatctt gtcaaaccca 60 
cgttactana cttttctttt taaggaantg 120 
ttgaaaagca tttacttttt tgaccacgan 180 
gtgggttttt ggtttttgtt ttaatttttg 240 
ggccccatga aatgggtgcc ccactcactt 300 
ctttgtcnga anctgaacaa nctgtggctt 360 
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ctttccaact ccgtgtg^ „ tttctgaatg ttatgtggta agaccccgcg gc, ^-ggcan 420 

caactgccct ggaaccccan cccctgcntc catctgtngc tgntgcgccc cacantaaaa 480 

cnctcanacn ccctgaaaa gttcttgaaa aaanttaatt anattgtccc nnttttactg 540 

ggaaaaaatt anccntactc cnnccaattt tnttcttncn antgggctga aagaaatcct 600 

tgttccccan t-ttcctttg ggcn 624 



<210> 30 

<211> 641 

<212> DNA 

<213> Mus abbotti 



<400> 30 

atcccagacc 

caatgaggag 

tgtnaataga 

aagcagttct 

caacaaccag 

aaagaatggt 

natgacattg 

gacactttca 

cagtccgtta 

gatgcctcna 

rttgttcccg 



aggcaacaag 
gaatcagatg 
ggangcagcc 
gcz tcttcac 
attcttgaag 
gaatgtgaca 
agagaaagac 
tatcaccaac 
actacagagt 
gaactaatgt 
trgtttaaat 



gtcctttgag 
aagtggagga 
gaagtcgcag 
ccctacatca 
tgaaaagtcc 
aggcatacct 
acattctgca 
acaacatttg 
tcctggaaac 
gtttnnnnnn 
gaaaaccccc 



gtctataatg 
taacgacaat 
agttagctta 
cgaacctcca 
nataaagcaa 
agacgaactg 
gcagatcgtg 
attttgatct 
atctggaaca 
nnnnngggtt 
cnaatgatgc 



aaagatctgc 
gactctgaaa 
agtgatggca 
ccacccttac 
agcaaatcag 
gtngagcttc 
aaccttatng 
ttgctcgctg 
tcctgaagat 
nnnnnntttg 



attctgatga 60 
tggagaggcc 120 
gcgatagtga 180 
taaaaaccaa 240 
ataagcaaat 300 
ccagaangtt 360 
aagaaactgg 420 
gacaaaacca 480 
ataaccactg 540 
ggtgtgaatt 600 
641 



<210> 31 

<211> 400 

<212> DNA 

<213> Mus musculus 



<400> 31 

gcgagccgac ggagnggctc tacggtggat agcgtgttcc ggaacctatc cctcgaatta 60 

gccgagtcag gcagagaggg ggcggggngt gcttccgccc ttgctaggag gggctgcatt 120 

gcaggggaga cccagcggca gattctgtca cagacgaggg agaaggcgtg aggagacaaa 180 

gccgtcacat ccgcgacagc ttccttcagc agcctcctcc tctccagtcc agagccgacc 240 

cccgagcccc tgaggcatcc cttccgtctt cggaacaccc tagtcattca ttgctaacag 300 

gggatcatga gggaccctgt aagtagccag tacagctcct ttcttttctg gangatgccc 360 

atcccacaac tggatctgtc ngagctggaa ggcctgggcc 400 



<210> 32 

<211> 400 

<212> DNA 

<213> Mus musculus 



<400> 32 

acggctgcga gaagacgaca gaagggaagc taagggctgg tcgctgcgtc tgaacgccgg 60 

gttggacgaa ccgctgtgcg cccttggcgg acgtnagcgg aaagaagatg gcggrgcagg 120 

tggtgcaagc tgtgcangcg gttcatcttg antctgacgc tttcctantt tgtctcaacc 180 

atgctctgag cacagaaaan gangaagtga tgggtctgtg tataggggan ttgaatgatg 240 

acataaggag tgactccnaa tttacataca ctggaacgga aatgcgcaca gtccnagaaa 300 

agatggatac catcagaatt gttcatatcc attctgtcat catcttgcgg cgttctgaca 360 

aganaaanga ccgtgtnnaa atttctccan ancanctgtc 400 



<210> 33 
<211> 400 
<212> DNA 

<213> Mus musculus 



<400> 33 

gatggcggtc tccacaggag ttaaagttcc 

agaaggacaa aaaggagtag gtgatggtac 

catgacactt acaaggtgga caggcatgat 

cagaatatat agcctgaaag tagaatgtgg 

-agatttgta acaaaaatta atatgaatgg 

acggagcata ccagtattag caaaatggca 



tcgtaatttt cgcttgttgg aagaacttga 60 

tgttagctgg ggccttgaag atgatgaaga 120 

tattgggcca ccaaggacaa actatgaaaa 180 

atctaaatac ccagaagctc ctccatcagt 240 

gatcaataat tccagtggaa tggtggatgc 300 

aaattcctat agcattaaag tcatacttca 360 
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aganctaaga ctcttatv . gtccaaagaa aaiatgaanc 400 



<210> 34 

<211> 400 

<212> DNA 

<213> Mus musculus 



<4O0> 34 


















acggctgcga 


gaagacgaca 


gaagggggag 




tctgcgtc 


ccagggactc 


cagta 


caeca 


60 


ccatggcgga 


ttctgagcgt 


ctctcggccc 




ggctgctg 


gttagcctgc 


accag 


crtct 


120 


cgcgcaccaa 


aaagggaatt 


ctcctgtttg 




ganattat 


actgtgcctg 


gtgac 


cttga 


180 


tttgcttcag 


tgcatctaca 


acatcggcct 


ac 


-cctccct 


gtcggtgatt 


gagat 


gatct 


240 


gtgctgctgt 


cttacttgtc 


ttctacacgt 


gt 


gacctgca 


ctccaagata 


tcatt 


catca 


300 


actggccttg 


gactgacttc 


ttcagatccc 




ataacaac 


catcctgtac 


ctgar 


cacct 


360 


ccattgttgt 


ccttgtagaa 


agaagaagca 




rccagagt 








400 



<210> 35 

<211> 400 

<212> DNA 

<213> Mus musculus 



<400> 35 

tacggctgcg 

gacaccggtc 

gaaaaaggga 

gaaaattggt 

caatagtcag 

agtgcctgtg 

gatcctgett 



agaagacgac 
tttgeagtec 
ggagatggaa 
atcgttggat 
gcttcancag 
ccagatgaga 
tcctaaatgt 



agaaggggct 
gggaccccgt 
ttaaaccgcc 
tgccaaatgt 
aaaacttccc 
ggttcgactt 
aatggatatt 



ccgccttctt 
gcgatcgtta 
tccaattatt 
egggaaatet 
atrctgeact 
tetttgecag 
gctggccttg 



ccccacctcc 
gcccggtgca 
ggaagatttg 
accttcttca 
attgatccta 
taccataaac 



ggctgccggc 60 
cgatgccccc 120 
gaaccrcact 180 
atgtattaac 240 
atgagagcag 300 
cancaagcaa 360 
400 



<210> 36 

<211> 400 

<212> DNA 

<213> Mus musculus 



<400> 36 

acggctgcga 

gagttgggct 

tgtnaatccg 

tgttcnacag 

tcaacagaag 

gcacctccct 

ttangaaagg 



gaagacgacn 
gtegtcnant 
aangectate 
tcttgttnct 
catctctgan 
etnetgtega 
cctntgggtt 



gaaggggtcc t: 
cgeggegana c; 
cccccncana z: 
acaancagct c: 
ttcattgtga t: 
aaaaaanaat g: 
tccagccntc a: 



rctcctgc gccgcgtttc 
ggcgcacc geagecatga 
;ccacctc accaagaanc 
r.gaaagga ccnacgaacc 
-ngcanan ctgaaccttg 
:cctacct nttT:gtncnn 
scctgtct 



agtcccrctt 60 
cagaacctga 120 
tgctggacct 180 
nccaaaaccc 240 
gagat ctcct 300 
ccaacaggct 360 
400 



<210> 37 

<211> 394 

<212> DNA 

<213> Mus musculus 



<400> 37 

gaaaggegee 

gecacatttg 

ccaggcttcc 

ccanccgccg 

gtggccgctc 

aagcgccgtt 

gtggctcccc 



tagatacege 
actgaagaag 
tcngccccca 
tcccctcaga 
tgaagctgee 
ctgacccggt 
tenatgenna 



gatacttgeg 
gaeaaegggg 
naaagccacc 
gganangaaa 
tctcctgnta 
tctccacatt 
cactctgngg 



<210> 38 
<211> 315 
<212> DNA 

<213> Mus musculus 



cg^atcctgg 
tcaaggtttg 
anacctcaca 
ttccgtgttc 
tc-annctgt 
gaectgegga 
aaat 



agctggactc 
aggactgaca 
tggaaccaaa 
ttgtr.ggcgt 
tggacgttcc 
tgtgggctga 



tcctgrtctt 60 
gattc-anac 120 
ggccccatgc 180 
cactggcagc 240 
tgngargtgg 300 
cctcatgcta 360 
394 



<400> 38 
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acggctgcga gaagacg, . gaangggggc aggatggctc ggtggtacag t\. .agatca 60 

anangcatac cccactgagc aagctgatga agcctactgt gagaggcagg gcttgtcnat 120 

nangcagatt cgattccggt ttgatggaca accaatcaat gaaacagaca ctccacccca 180 

nctggagatg gaggatgagg acaccnttga tgtattccag cagcagacag gaggatcanc 240 

ctcccgaggg agcgtcccac acccaaccgt tgtcctgacc tgtgctattg agcagrgacc 300 
atgctnccac accca 315 

<210> 39 
<211> 400 
<212> DNA 

<213> Mus musculus 
<400> 39 

acggctgcga gaagacgaca gaaggggaga gcatcatggc gctaagcggt cgactggcat 60 
tggccgcgct cagactgtgg ggtccgggag gatgtatctg anctgcttcg gaccanancc 120 
tgcagtaggg tggtggtcat ggtgggggcc ggcatcanca cacccagtgg catcccggac 180 
ttcagatccc caggggagcg gcctctacag caaccttcag cagtatgaca tcccctaccc 24 0 
tgaagccatc tttgaacttg gctttttctt tcacaacccc aagccctttt tcatgttggc 300 
caaggactgt nccctgngca ctacaggcca atgtcactca ctacttcctg aagctcctcc 360 
acgacangga ccgcttctgc gctctntaca canaaatcga 4 00 

<210> 40 
<211> 349 
<212> DNA 

<213> Mus musculus 
<400> 40 

ggccatcaan cgtgctgttg gtcatggccg actcctcaaa gatggacccc tctaccgcct 60 
caacactaag gcagccagtg ctgaagggaa agagggctgt gagtcgctct cgtgtttacc 120 
tcccgtgtca ctgcttccac acganaaaga caagccggtt gctgaaccaa tccccatctg 180 
tagcttctgt cttggtacaa aagaacagaa ccgggagaag cagcccgagg aactcgtctc 240 
ctgcgcggac tgcggcaata ncggtcatcc atcgtgttta aagttctccc canagctnac 300 
agtgagagtg aagccttacn gtggcttnca ttgantgtaa aacatcact 34 9 

<210> 41 
<211> 397 
<212> DNA 

<213> Mus musculus 
<400> 41 

cccacgcgtc cgatctcctc cagggccacc aagcacctct gaagagccat gttccaagct 60 
gccggagccg cccaggccac cccctctcat gaagccaaag gcagcagtgg cagcagcacg 120 
gtacagcggt ctaagtcctt tagcttgcgg gctcaggtga aggagacctg tgcagcctgc 180 
cagaagactg tgtacccgat ggagcggctg gtggcagaca agctcatttt ccacaactct 240 
tgtttctgtt gcaaacactg ccacaccaaa ctcagcctgg gcagttatgc tgcaatgcac 300 
ggtgaatttt actgcagacc tcactttcag cagctgttta agagtaaagg caactacgat 360 
gaaggttttc gtaaacagca caaggactct gggccac 397 

<210> 42 
<2U> 158 
<212> PRT 

<213> Mus musculus 
<400> 42 

Met Phe Gin Ala Ala Gly Ala Ala Gin Ala Thr Pro Ser His Glu Ala 
15 10 15 

Lys Gly Ser Ser Gly Ser Ser Thr Val Gin Arg Ser Lys Ser Phe Ser 
20 25 30 

Leu Arg Ala Gin Val Lys Glu Thr Cys Ala Ala Cys Gin Lys Thr Val 
35 40 45 



Tyr Pro Met Glu Arg Leu Val Ala Asp Lys Leu lie Phe His Asn Ser 
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50 



55 



60 



Cys Phe Cys Cys Lys His Cys His Thr Lys Leu Ser Leu Gly Ser Tyr 
65 70 75 80 

Ala Ala Met His Gly Glu Phe Tyr Cys Arg Pro His Phe Gin Gin Leu 
85 90 95 

Phe Lys Ser Lys Gly Asn Tyr Asp Glu Gly Phe Gly Arg Lys Gin His 
100 105 110 

Lys Glu Leu Trp Ala His Lys Glu Val Asp Ser Gly Thr Lys Thr Ala 
115 120 125 

Asp Pro Phe Asn Thr His Ser Leu Pro Ala His Gly Leu Pro Leu Gly 
130 135 140 



Ser Gly Lys Glu He Asn Pro Gly Ala Arg Gly Gly Arg Gly 
145 150 155 



<210> 43 

<211> 1360 

<212> DNA 

<213> Mus musculus 

<400> 43 

agctctagtc cccagagatg tcgccactac tgctgctgct gctgtgcctg ctgctgggga 60 
atttggagcc tgaggaggcc aaactgatcc gtgtccctct tcaacgaatc caccttggac 120 
acagaatctt aaacccactg aatggatggg aacagctggc agagctttct aggacctcca 180 
cctctggtgg caacccctcc tttgtgcctc tctccaagtt catgaacacc cagtattttg 240 
gaactattgg ttzgggaacg cctcctcaga atttcaccgt tgtctttgac acgggttctt 300 
ccaacttgtg ggctccgtcc acgagatgtc atttcttcag tttggcatgc tggtttcacc 360 
atcgctttaa tcccaaggcc tccagctcct tcaggcccaa tgggaccaag tttg'ccattc 420 
agtatgggac cgggcggctg agcggaatcc tgagccagga caatctgact atcgggggga 4 80 
tccacgatgc ttttgtgaca tttggagagg ctctgtggga gcccagcctg atctttgctt 540 
tagcccactt tgatgggatc ctgggcctcg gcttccccac tctggctgtg ggcggagttc 600 
agcctccgct ggatgcgatg gtggagcaag ggctgctgga gaaacccgtc ttctcctttt 660 
acctcaacag ggattctgaa gggtctgatg ggggagagct ggtcctaggg ggctcagacc 720 
ccgctcacta cgtacctccc ctcaccttca taccagtcac catccctgcc tactggcagg 780 
tccacatgga gagtgtgaag gtcggcacag ggctgagcct ctgtgcccag ggctgcagtg 840 
ccatcctaga cacaggcaca tccctcatca caggacctag tgaggagatc cgggccttga 900 
ataaagccat tgggggatat cccttcctga atgggcagta cttcattcag tgttccaaga 960 
cgccaacgct tccccctgtc tccttccacc ttggtggagt ctggtttaac ctcacaggcc 1020 
aggactatgt caccaagatt cttcagagcg atgttggcct ctgcctgttg ggcttccaag 1080 
ccttggatat ccccaagcct gcgggacccc tctggatcct tggggacgtc tttttggggc 1140 
cctatgtggc tgtctttgac cgtggggaca aaaacgtcgg cccgcgcgtg ggactggcgc 1200 
gtgctcagtc tcgttcaaca gaccgggcag aaagaaggac tacgcaggcg cagttcttca 1260 
aaagacgccc tggttagggt acaagctcac cgggccacag cagctatgct tctttccaat 1320 
caaacaaact aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1360 

<210> 44 
<211> 419 
<212> PRT 

<213> Mus musculus 
<400> 44 

Met Ser Pro Leu Leu Leu Leu Leu Leu Cys Leu Leu Leu Gly Asn Leu 
15 10 15 

Glu Pro Glu Glu Ala Lys Leu He Arg Val Pro Leu Gin Arg He His 
20 25 30 

Leu Gly His Arg He Leu Asn Pro Leu Asn Gly Trp Glu Gin Leu Ala 
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35 



40 



45 



Glu Leu Ser Arg Thr Ser Thr Ser Gly Gly Asn Pro Ser Phe Val Pro 
50 55 * 60 

Leu Ser Lys Phe Met Asn Thr Gin Tyr Phe Gly Thr lie Gly Leu Gly 
65 70 75 80 

Thr Pro Pro Gin Asn Phe Thr Val Val Phe Asp Thr Gly Ser Ser Asn 
85 90 95 

Leu Trp Val Pro Ser Thr Arg Cys His Phe Phe Ser Leu Ala Cys Trp 
100 105 110 

Phe His His Arg Phe Asn Pro Lys Ala Ser Ser Ser Phe Arg Pro Asn 
115 120 125 

Gly Thr Lys Phe Ala He Gin Tyr Giy Thr Gly Arg Leu Ser Gly He 
130 135 140 

Leu Ser Gin Asp Asn Leu Thr He Giy Gly He His Asp Ala Phe Val 
145 150 155 160 

Thr Phe Gly Glu Ala Leu Trp Glu Pro Ser Leu He Phe Ala Leu Ala 
165 170 175 

His Phe Asp Gly He Leu Gly Leu Gly Phe Pro Thr Leu Ala Val Gly 
180 185 190 

Gly Val Gin Pro Pro Leu Asp Ala Met Val Glu Gin Gly Leu Leu Glu 
195 200 205 

Lys Pro Val Phe Ser Phe Tyr Leu Asn Arg Asp Ser Glu Gly Ser Asp 
210 215 220 

Gly Gly Glu Leu Val Leu Gly Gly Ser Asp Pro Ala His Tyr Val Pro 
225 230 235 240 

Pro Leu Thr Phe He Pro Val Thr He Pro Ala Tyr Trp Gin Val His 
245 250 255. 

Met Glu Ser Val Lys Val Gly Thr Gly Leu Ser Leu Cys Ala Gin Gly 
260 265 270 

Cys Ser Ala He Leu Asp Thr Gly Thr Ser Leu He Thr Gly Pro Ser 
275 280 285 

Glu Glu He Arg Ala Leu Asn Lys Ala He Gly Gly Tyr Pro Phe Leu 
290 295 300 

Asn Gly Gin Tyr Phe He Gin Cys Ser Lys Thr Pro Thr Leu Pro Pro 
305 310 315 320 

Val Ser Phe His Leu Gly Gly Val Trp Phe Asn Leu Thr Gly Gin Asp 
325 330 335 

Tyr Val He Lys He Leu Gin Ser Asp Val Gly Leu Cys Leu Leu Gly 
340 345 350 

Phe Gin Ala Leu Asp He Pro Lys Pre Ala Gly Pro Leu Trp He Leu 
355 360 365 



Gly Asp Val Phe Leu Gly Pro Tyr Va: Ala Val Phe Asp Arg Gly Asp 
370 375 380 
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Lys Asn Val Gly 
385 

Thr Asp Arg Ala 
Arg Pro Gly 



Pro Arg Val Gly 
390 

Glu Arg Arg Thr 
405 



Leu Ala Arg Ala 
395 

Thr Gin Ala Gin 
410 



Gin Ser Arc Ser 
400 

Phe Phe Lys Arg 
415 



<210> 45 

<211> 3338 

<212> DNA 

<213> Mus musculus 

<400> 45 

cggtgacgca ggaccaggac tcgcgcgtcc 
gcgctctcag cctgatccct gtcttggcgg 
tcgtgaagag tcgatggtta cttcggcaga 
ggttcgacct gtggtcagac ggtcacctga 
tagaggataa ggtccacatg cccgtggact 
gggacatcca gcctccagat gggaagccca 
acgggaagac carcagtctc tgtgcagaga 
cactgcagga ttccagaaca aacacagctt 
ctgcagtggc cgcgtccccg cctccctacg 
acggctatgg tccatacagc ggcgcatacc 
acgggcaggc atatgcagtg ccataccagt 
ctgccaacca agtcatcatc cgcgagcggt 
gcatgctcgc cggggcagcc accggcatgg 
gccttcaaca ttttctgtgc atagcttctg 
gggcatttct gtttgtgaca agtgtctttc 
gtaatctaat aattgtgact gacctgcatg 
gagtgagagc ctgagtcctt ccgggtacta 
ggatggcttt atgtaaacac acccagttaa 
gttgaattaa gggatggctt tccttgctat 
tgtgtgcaag tgtggtgagg agagtcacag 
cctgctgcgt gttaccctag gagctgtggg 
gaccctgtaa cgcacctctt ggagcactta 
gcaggattgg ctgctcagga tgcagagggc 
taatctgccc agttttattt tcttgttatt 
cattagtctg gagttaggaa ttgattcgag 
tccagcgacc tgctctcagt gtttgtttta 
gagcctatag agactatttt tctacttgta 
gccttgtttg aatgttcctg cctcaggcct 
ggcttttcct cagcagtgga agggaggccc 
gagaaatggg atcgggtaga gcatctcagg 
tccagccgtg ttctcagtcc agacccgctc 
cccaaatgtg tggcactctg agaagctcac 
ctaaaattag agaatacatg ttccacgcat 
ctcccaaatc cagctttgtg actggcatat 
gtgggatggc acagtgaagg ccccccgccc 
tgctcccagg aagcattcat tctgactttg 
gccgtgctgc tgggtagctc tttcttggtc 
gacagttgcc aagtagatga ggtggtgccc 
tgttgacacc tgcccatttc cagctgacat 
tatgtgacaa catggtacgg acgccgttca 
tacatcctgt gctgtatgac tgccatttgc 
ctttacttac aaaacactaa aaccagtggt 
ttagttactg gcattcctaa agccatagag 
aacagaattg tctcctactg tcccatgggt 
gctagctgta ccaggcagcc atagttgagc 
ttatataccc agaatggaat acartcttgg 
agacgccacc ctttcgatgg cttttgcggc 



agcggagaag caggagaagc cggcgacctt 60 
cctgaacatt cgcagctgga gagacggcgt 120 
gcaccattct gaaacgctgg aagaagaatt 180 
tctactacga tgatcagact cggcagagca 240 
gcatcaatat ccgcacgggg catgagtgcc 300 
gagactgtct gctgcagatc gtttgccgag 360 
gcacagacga ttgcctggca tggaagttta 420 
acgttggttc agcaatcctg tctgaagaga 4 80 
caacctatgc tacaccnacc cctgaggtct 54 0 
ccgcaggaac tcaagttgtc tatgccgcca 600 
acccgtatgc aggagtttat ggacaacagc 660 
accgagacaa tgacagtgac ctggctctgg 720 
ccctgggctc tctgttctgg gtcttctaga 780 
ttagtcctgt gtgcagtaat ttgatttgca 840 
ataataattt aaatagttct tttgaaggtg 900 
gtaccacaaa gaaagcccga ggtatgctgt 960 
gcttgcacca agtctttctt agggactttt 1020 
atgggcaatt tccgtccagt taggtgcagt 1080 
gccaatacta atactgctga tggaggaaga 1140 
cttctttaac tgtggattct cttctagacc 1200 
ctggtggctc ccgcaagact atggrgtgag 1260 
ggtacccctt gaagctccta ggtaccacca 1320 
caccccctcc ctttaaaaat tacgcrccag 1380 
cttctgtttg cttttcctgg ggatgattgg 1440 
tgccggtggg tggaggcatg cagggagctg 1500 
ggtatattga ttgccagctc aggctgcaga 1560 
aagaaagtat ggtgagggga attgaggaga 1620 
cccggggccc acactgcgtg gtcctgggga 1680 
cgtggctgtc cagagtctca ggttrtgagt 1740 
gatttgttct aatccctcat gttatgggga 1800 
acctcagaag agcttaaaac atttctggtc 1860 
aatctggctt tctaacgaaa atttgtattt 1920 
ttaaaattta tgttctttca tgttttaaag 1980 
tttagtttca aacagtaccc cggcacaaag 2040 
tctactttgc atagtcttgt ttctccaggg 2100 
ctcagcccag tgcatgcgtg ctgccttgcc 2160 
agatcaagtc ttcaacagat ctccatgtga 2220 
atagtgcttt ctcgatactc cttggggacc 2280 
ttgtttttct gtcatctctg atagatggga 2340 
gtgtcgcttt aataagcatg atgctgattt 2400 
tcacagtgtc accattgcta aagctccgtg 24 60 
tagtgtttca cagtgatttt aattttagag 2520 
tactgagtca catccctgaa gtacttttga 2580 
gtgccctgcc tgtcccctgg ccccaatggg 2 64 0 
ccgatcattc ctgrcaccag tttgacttga 2700 
gcatctcagt tcc-cagccc tgatcctcat 2760 
gtcacttgta ccicagtgag tccrgcgatt 2820 
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cttgagttag aggggac-^ ttgtccagca ttgaggaaca tgtctcctcc at .agactt 2880 
aaatgatgat gcagggctgg aagaggctgg ctgctgacac tgcatcgtgg ctgatgtcat 294 0 
tgctctccta gtcctttgat ttaagaacct ttcatatgga aggcctgagg ctccctcaga 3000 
tcgtcccttg ccaagaaggc ctggcttagg tcattagtgc ccacagtagc cttctggagt 3060 
gtagcaagtt cctgcgtttg agacagaatg gttcagattt attttctaca tctgttgttg 3120 
accccatgca ccc^ctcatt ttgccttcca gtctacgtag atgaaagatg aaaggcagag 3180 
gatgcagaca gtcttctttg tgattgcttc tgttattctg ttgcatctac cgagcccgtt 3240 
ttctccctgt ctgcgcatac agtatgttta taagtgaact tgttaaaata ttaaatgatc 3300 
actaacccta aaaaaaaaaa aaaaaaaaaa aaaaaaaa 3338 

<210> 46 
<211> 221 
<212> PRT 

<213> Mus musculus 
<400> 46 

Met Ala Phe Vai Lys Ser Gly Trp Leu Leu Arg Gin Ser Thr He Leu 
15 10 15 

Lys Arg Trp Lys Lys Asn Trp Phe Asp Leu Trp Ser Asp Gly His Leu 
20 25 30 

He Tyr Tyr Asp Asp Gin Thr Arg Gin Ser He Glu Asp Lys Val His 
35 40 45 

Met Pro Val Asp Cys He Asn He Arg Thr Gly His Glu Cys Arg Asp 
50 55 60 

He Gin Pro Pro Asp Gly Lys Pro Arg Asp Cys Leu Leu Gin He Val 
65 70 75 80 

Cys Arg Asp Gly Lys Thr He Ser Leu Cys Ala Glu Ser Thr Asp Asp 
85 90 95 

Cys Leu Ala Trp Lys Phe Thr Leu Gin Asp Ser Arg Thr Asn Thr Ala 
100 105 110 

Tyr Val Gly Ser Ala He Leu Ser Glu Glu Thr Ala Val Ala Ala Ser 
115 120 125 

Pro Pro Pro Tyr Ala Thr Tyr Ala Thr Xaa Thr Pro Glu Val Tyr Gly 
130 135 140 

Tyr Gly Pro Tyr Ser Gly Ala Tyr Pro Ala Gly Thr Gin Val Val Tyr 
145 150 155 160 

Ala Ala Asn Gly Gin Ala Tyr Ala Val Pro Tyr Gin Tyr Pro Tyr Ala 
165 170 175 

Gly Val Tyr Gly Gin Gin Pro Ala Asn Gin Val He He Arg Glu Arg 
180 185 190 

Tyr Arg Asp Asn Asp Ser Asp Leu Ala Leu Gly Met Leu Ala Gly Ala 
195 200 205 

Ala Thr Gly Men Ala Leu Gly Ser Leu Phe Trp Val Phe 
210 215 220 



<210> 47 

<211> 2396 

<212> DNA 

<213> Mus musculus 
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<400> 47 

caccacagag gaccccttca acctaaggaa acacccaggc ttcgatagga ccatgctgca 60 
gaggtggcag aaaagggaga tcagcaactt tgaatacctc atgtacctca acacactggc 120 
cggaaggacc tacaatgact acatgcagta tcccgtgttr ccctgggtcc ccgctgacta 180 
sacctcagag atgttgaact tgacgaatcc caagactt-c cgggatcttt ctaagccaat 240 
gggggctcag accaaggaaa ggaagttgaa gtttacccag aggtttaaag atgttgaaaa 300 
gattgaagga gacatgaccg tgcagtgcca ctactacacc cactattcct cagccatcat 360 
tgtcgcttcc tacttggtcc gaatgccacc attcacgcag gccttctgct ccttacaggg 4 20 
cggaagcttt gatgtggctg atagaatgtt ccacagtgta aagagcacgt gggagtctgc 4 80 
ctccaaagag aacatgagcg atgtcaggga gctgacacct gaattcttct acctgcccga 540 
gtttttaacc aactgtaatg cagtggagtt tggctgcarg caggatggaa cgacactggg 600 
ggatgtgcag cttcctccct gggctgacgg ggatccgagg aaattcatca gcttgcacag 660 
acaggctctg gaaagtgact tcgtcagcag caacctccac cactggatag acctaatttt 720 
tgggtataag cagcaggggc cggctgctgt agaggcagtg aacactttcc acccctactt 780 
ctacggtgat agaatagacc tgggcagcat cactgacccg ctgatcaaga gcaccatcct 840 
gggcttcatc agcaactttg gacaggtgcc caagcagatc ttcactaaac cccacccatc 900 
cagaaacacc acagggaaaa acccagggcc tggaaaggat gcttccaccc ctgtaggcct 960 
cccaggccac tcacagtcct tcctccacag cctgccagca ctgagaccct ctcaggtcac 1020 
agtcaaagat atgtaccttt tctctctagg gtcggaatcc cccaaagggg ccatcggcca 1080 
catcgtccct actgagaagt caatcctggc agtggagaag aacaagctgc tgatgccccc 114 0 
tctctggaac aggaccttca gctggggctt tgatgacttc agttgctgcc tggggagcta 1200 
cggctctgac aagatcctga tgacctttga gaacctggct gcctggggtc cctgtctgtg 1260 
cgctgtatgc ccttccccca cgatgatcgt cacatccggg gccagcgcag tggtgtgcat 1320 
ctgggagctg agcctggtca aaggtcgccc gagaggtctg aaactccgac aggccttgta 1380 
tggacacact caggcggtca catgtctgac agcctctgtc accttcagcc tcctggtgag 14 40 
cggatcccag gatcgcactt gtatcctgtg ggacctggac cacctctctc gtgtggcctg 1500 
cctgcctgtc caccgggaag gcatctcagc cattgccatc agtgatgtct cgggaaccat 1560 
tgtctcctgt gccggagccc acttgtccct gtggaatgtc aatggacagc ccctggccag 1620 
tattaccaca gcctggggcc cagaaggaac cataacgtgc tgctgcatag tagaggggcc 1680 
agcgtgggat gcaagccacg tgatcatcac ggggagtaag gacggaatgg ttcggatttg 1740 
gaagacagag gacgtgaaga tgcctgttcc caggcaggca gtgatggagg agccctccac 1800 
ggagccccta agccccagag gtcacaagtg ggccaagaat cttgccctga gccgagagct 1860 
ggatgtcagt gttgctctga gtggcaagcc cagcaaggca agtcctgctg tgacagctct 1920 
ggccatcact aggaaccaga gcaagctctt ggttggcgat gagaagggcg aatcttctgc 1980 
tggtctgctg atgggtagga gacaaggagc tggagggacc gacctgaaag cctggagccc 204 0 
tggggtcggc agcaacaggc tacaggcaca agatgatgtg tagcctgggc tgcttaacca 2100 
gagcaagttt tgggggggct ccactccaca cagttctcaa ggagtccctg atggtttgca 2160 
ccgtgtaccc taaacatgtc tgtagtctat gggacttctg taagaaggat cttggtagac 2220 
actgatgctg gaaactgacg ctgcatggga aatttctacg tggctcactt caccaaggct 2280 
tattgcactg ggaaaagaag agggggtggg tattggttca tggaaaccac cccaccgtct 234 0 
ttattttatt aaaactccat tttccgaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 2396 

<210> 48 
<211> 693 
<212> PRT 

<213> Mus musculus 
<400> 48 

Thr Thr Glu Asp Pro Phe Asn Leu Arg Lys His Pro Gly Phe Asp Arg 
15 10 15 

Thr Met Leu Gin Arg Trp Gin Lys Arg Glu lie Ser Asn Phe Glu Tyr 
20 25 30 

Leu Met Tyr Leu Asn Thr Leu Ala Gly Arg Thr Tyr Asn Asp Tyr Met 
35 40 45 

Gin Tyr Pro Val Phe Pro Trp Val Leu Ala Asp Tyr Thr Ser Glu Met 
50 55 60 

Leu Asn Leu Thr Asn Pro Lys Thr Phe Arg Asp Leu Ser Lys Pro Met 
65 70 75 80 

Gly Ala Gin Thr Lys Glu Arg Lys Leu Lys Phe Thr Gin Arg Phe Lys 
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8. 



90 



Asp Val Glu Lys lie Glu Gly Asp Met Thr Val Gin Cys His Tyr Tyr 
100 105 " 110 

Thr His Tyr Ser Ser Ala lie lie Val Ala Ser Tyr Leu Val Arg Met 
115 120 125 

Pro Pro Phe Thr Gin Ala Phe Cys Ser Leu Gin Gly Gly Ser Phe Asp 
130 135 140 

Val Ala Asp Arg Met Phe His Ser Val Lys Ser Thr Trp Glu Ser Ala 
145 150 155 160 

Ser Lys Glu Asn Met Ser Asp Val Arg Glu Leu Thr Pro Glu Phe Phe 
165 170 175 

Tyr Leu Pro Glu Phe Leu Thr Asn Cys Asn Ala Val Glu Phe Gly Cys 
180 185 190 

Met Gin Asp Gly Thr Thr Leu Gly Asp Val Gin Leu Pro Pro Trp. Ala 
195 200 205 

Asp Gly Asp Pro Arg Lys Phe He Ser Leu His Arg Gin Ala Leu Glu 
210 215 220 

Ser Asp Phe Val Ser Ser Asn Leu His His Trp He Asp Leu He Phe 
225 230 235 240 

Gly Tyr Lys Gin Gin Gly Pro Ala Ala Val Glu Ala Val Asn Thr Phe 
245 250 255 

His Pro Tyr Phe Tyr Gly Asp Arg He Asp Leu Gly Ser He Thr Asp 
260 265 270 

Pro Leu He Lys Ser Thr He Leu Gly Phe He Ser Asn Phe Gly Gin 
275 280 285 

Val Pro Lys Gin He Phe Thr Lys Pro His Pro Ser Arg Asn Thr Thr 
290 295 300 

Gly Lys Asn Pro Gly Pro Gly Lys Asd Ala Ser Thr Pro Val Gly Leu 
305 310 * 315 320 

Pro Gly His Ser Gin Ser Phe Leu His Ser" Leu Pro Ala Leu Arg- Pro 
325 330 335 

Ser Gin Val Thr Val Lys Asp Met Tyr Leu Phe Ser Leu Gly Ser Glu 
340 345 350 

Ser Pro Lys Gly Ala He Gly His He Val Pro Thr Glu Lys Ser He 
355 360 365 

Leu Ala Val Glu Lys Asn Lys Leu Leu Met Pro Pro Leu Trp Asn Arg 
370 375 380 

Thr Phe Ser Trp Gly Phe Asp Asp Phe Ser Cys Cys Leu Gly Ser Tyr 
385 390 395 400 

Gly Ser Asp Lys He Leu Met Thr Phe Glu Asn Leu Ala Ala Trp Gly 
405 410 415 

Pro Cys Leu Cys Ala Val Cys Pro Ser Pro Thr Met He Val Thr Ser 



420 



425 



430 
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Gly Ala Ser Ala Val Val Cys He Trp Glu Leu Ser Leu Val Lys Gly 
435 440 445 

Arg Pro Arg Gly Leu Lys Leu Arg Gin Ala Leu Tyr Gly His Thr* Gin 
450 455 460 

Ala Val Thr Cys Leu Thr Ala Ser Val Thr Phe Ser Leu Leu Val Ser 
465 470 475 480 

Gly Ser Gin Asp Arg Thr Cys He Leu Trp Asp Leu Asp His Leu Ser 
485 490 495 

Arg Val Ala Cys Leu Pro Val His Arg Glu Gly He Ser Ala He Ala 
500 505 510 

He Ser Asp Val Ser Gly Thr He Val Ser Cys Ala Gly Ala His Leu 
515 520 525 

Ser Leu Trp Asn Val Asn Gly Gin Pro Leu Ala Ser He Thr Thr Ala 
530 535 540 

Trp Gly Pro Glu Gly Thr lie Thr Cys Cys Cys He Val Glu Gly Pro 
545 550 555 560 

Ala Trp Asp Ala Ser His Val He He Thr Gly Ser Lys Asp Gly Met 
565 570 575 

Val Arg He Trp Lys Thr Glu Asp Val Lys Met Pro Val Pro Arg Gin 
580 585 590 

Ala Val Met Glu Glu Pro Ser Thr Glu Pro Leu Ser Pro Arg Gly His 
595 600 605 

Lys Trp Ala Lys Asn Leu Ala Leu Ser Arg Glu Leu Asp Val Ser Val 
610 615 620 

Ala Leu Ser Gly Lys Pro Ser Lys Ala Ser Pro Ala Val Thr Ala Leu 
625 630 635 640 

Ala He Thr Arg Asn Gin Ser Lys Leu Leu Val Gly Asp Glu Lys Gly 
645 650 655 

Glu Ser Ser Ala Gly Leu Leu Met Gly Arg Arg Gin Gly Ala Gly Gly 
660 665 670 

Thr Asp Leu Lys Ala Trp Ser Pro Gly Val Gly Ser Asn Arg Leu Gin 
675 680 685 



Ala Gin Asp Asp Val 
690 



<210> 49 
<211> 2363 
<212> DNA 

<213> Mus musculus 
<400> 49 

cagaggcagg caggacaggc actgcasgca 
asacaggaag aggcagtgag gagaaaagga 
gaggagtgct ggccaacggc tgaagtgtcc 
catctcagtc acatggaacg gaccctgtct 
gtaagatggg catctggagg gctggtcctg 



ctgagagcct tgcagtcaga agggaagcac 60 
gcasaactga gacaagcaat ggagatccct 120 
ctaaaagaca tcactgaaat aatggagaga 180 
cacagtcaaa agctctcaga tggacacctg 24 0 
cagggaattt ataagaccaa ccacccaaga 300 
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agcctgattc agaagag t . agagctactc agtgtcccta agcagttctc ac gctggc 360 
ccagtacatg gcacagagat aaaaacaaag gaattttcat cttttcaaga acaagccatg 420 
tttacacaga ctatagagag ggtgggtttc agttcaaccc ccttagttaa aggagaaggt 480 
tggggactta gtctagaagc tggtatgggt cacaacaaac agacagaatc tgaagataac 540 
taccagtccc attcaaagca aacttatttt tgctcagcca ggttcagcta catcccattg 600 
gccacctgcc atrttcacat caatgatctt gaactctccc aggctgctct ccaggaacta 660 
aaaagtattg aagaaatcct ggagcagact acacaccacc gagatggact acccttactg 720 
aggcacaggg ctaaaaactt tttccacagg tttggctctc atgctaacca aggccctgtg 780 
cacctggggg gaa-ctactg ctggaaagcc atttcagaag gtttcaaaag tgagcacttg 840 
gctgatgtaa agcagcaagc agaagagtct ttgaatattt acattatggg cagttatagt 900 
ggctttggag ttaaagttgg tgcgagtgta aatataacaa attcaaaatc aaaaacagca 960 
ttttacagta aaactcatct aaactcgcaa accaaggtac aactatctgt agccaagata 1020 
ggtggaccag cagaagcaga tggaattgcc cagtggacag ctggccttgt agctagcaat 1080 
caaacctggt ctgrtattga taggaaactg cagttggtac ctatttggga cattatcctg 1140 
tccagtcaca gaactgaatt taagaatgct cttcaactgg ctaactgcct caaagaccac 1200 
tacactgctc tgactgaact agctgcccag attcaagaag gggaagaatt tctgactgct 1260 
agaaaagaag ctaagctttt cctaaagaat gtgaaaggct gggaggtttc tgatcctgaa 1320 
gaacagctta ggaagttagt agattttatg caaacattga gtcaaaaaat aaaaagttat 1380 
gacatttgga tcaacacatg cctcatagat tgggatctgc aaaattttct aataaacatt 1440 
gtcaacttct gcaaaaattc acccacttat aaaactcact ttattaaatc tcagttgtgc 1500 
agccttctag aacctcatgt ctacaaagtg acaaactttc ctcaggcaca atccatcata 1560 
cagtggatca atcagtcaga gtcagaggaa gaactagtca aaattacctc attttctgaa 1620 
ttcattaaca ccttaaagaa aacccacaaa tacctaatgg aagagagttt caaaactgag 1680 
cccccagaaa gagzggaaga agcaaagaga atggctacat atgaagtcac cacagctctc 1740 
agctccttct tgaagtacct cagagaaaca cagcagccag acatgcagct gttgctactc 1800 
tccattgcta ctggtgtagg ctatcagttg gtaaacagta tttttcagca tcttctgggg 1860 
tgtgatgagt taaacttcct cttggatcaa atggaaaata acgaacataa ataccaagaa 1920 
ctgaaaaata tttgcaatta cagagcccag gcattcttgg tgctcacagc cctaagagcc 1980 
acagttgaaa tcacagatgt ttctacagaa gagaaaggac aacgtttgac attaatacaa 2040 
caacatatgg ggtcactgtt gtctgaagaa gttgcacatg tcctcacaaa acatggagaa 2100 
catcatgact gggaaaggct ggagaatgat ttgagattac tcattgaggg ggactataaa 2160 
gccaccaccc attccttaca aatggatgaa gtaaaaaaac aattgcaaag tttntgccat 2220 
gaaaagaaac agacttataa acaacaaggt aatgaaaaca gaacaaaaga aatgatagaa 2280 
aatggacatt tcc-ggactt actccaacgt ttaggcctag acaattacta tccaaaaaaa 2340 
aaaaaaaaaa aaaaaaaaaa aaa 2363 

<210> 50 
<211> 778 
<212> PRT 

<213> Mus musculus 
<400> 50 

Gin Arg Gin Ala Gly Gin Ala Leu Xaa Ala Leu Arg Ala Leu Gin Ser 
15 10 15 

Glu Gly Lys His Xaa Gin Glu Glu Ala Val Arg Arg Lys Gly Ala Xaa 
20 25 30 

Leu Arg Gin Ala Met Glu lie Pro Glu Glu Cys Trp Pro Thr Ala Glu 
35 40 45 

Val Ser Leu Lys Asp lie Thr Glu lie Met Glu Arg His Leu Ser His 
50 55 60 

Met Glu Arg Thr Leu Ser His Ser Gin Lys Leu Ser Asp Gly Asp Leu 
65 70 75 80 

Val Arg Trp Ala Ser Gly Gly Leu Val Leu Gin Gly lie Tyr Lys Thr 
85 90 95 

Asn His Pro Arg Ser Leu He Gin Lys Arg Glu Glu Leu Leu Ser Val 
100 105 110 

Pro Lys Gin Phe Ser Leu Ala Gly Pro Val His Gly Thr Glu He Lys 
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115 120 125 

Thr Lys Glu Phe Ser Ser Phe Gin Glu Gin Ala Met Phe Thr Gin Thr 
130 135 140 

lie Glu Arg Val Gly Phe Ser Ser Thr Pro Leu Val Lys Gly Glu Gly 
145 150 155 160 

Trp Gly Leu Ser Leu Glu Ala Gly Met Gly His Asn Lys Gin Thr Glu 
165 170 175 

Ser Glu Asp Asn Tyr Gin Ser His Ser Lys Gin Thr Tyr Phe Cys Ser 
180 185 190 

Ala Arg Phe Ser Tyr lie Pro Leu Ala Thr Cys His Phe His He Asn 
195 200 205 

Asp Leu Glu Leu Ser Gin Ala Ala Leu Gin Glu Leu Lys Ser He Glu 
210 215 220 

Glu He Leu Glu Gin Thr Thr His His Arg Asp Gly Leu Pro Leu Leu 
225 230 235 240 

Arg His Arg Ala Lys Asn Phe Phe His Arg Phe Gly Ser His Ala Asn 
245 250 255 

Gin Gly Pro Val His Leu Gly Gly He Tyr Cys Trp Lys Ala He Ser 
260 265 270 

Glu Gly Phe Lys Ser Glu His Leu Ala Asp Val Lys Gin Gin Ala Glu 
275 280 285 

Glu Ser Leu Asn He Tyr He Met Gly Ser Tyr Ser Gly Phe Gly Val 
290 295 300 

Lys Val Gly Ala Ser Val Asn He Thr Asn Ser Lys Ser Lys Thr Ala 
305 310 315 320 

Phe Tyr Ser Lys Thr His Leu Asn Ser Gin Thr Lys Val Gin Leu Ser 
325 330 335 

Val Ala Lys He Gly Gly Pro Ala Glu Ala Asp Gly He Ala Gin Trp 
340 345 350 

Thr Ala Gly Leu Val Ala Ser Asn Gin Thr Trp Ser Val He Asp Arg 
355 360 365 

Lys Leu Gin Leu Val Pro He Trp Asp He He Leu Ser Ser His Arg 
370 375 380 

Thr Glu Phe Lys Asn Ala Leu Gin Leu Ala Asn Cys Leu Lys Asp His 
385 390 395 400 

Tyr Thr Ala Leu Thr Glu Leu Ala Ala Gin He Gin Glu Gly Glu Glu 
405 410 415 

Phe Leu Thr Ala Arg Lys Glu Ala Lys Leu Phe Leu Lys Asn Val Lys 
420 425 430 

Gly Trp Glu Vai Ser Asp Pro Glu Glu Gin Leu Arg Lys Leu Val Asp 
435 440 445 

Phe Met Gin Thr Leu Ser Gin Lys lie Lys Ser Tyr Asp He Trp He 
450 455 ' 460 
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Asn Thr Cys Leu lie Asp Trp Asp Leu Gin Asn Phe Leu lie Asn He 
465 470 475 480 

Val Asn Phe Cys Lys Asn Ser Pro Thr Tyr Lys Thr His Phe He Lys 
485 490 495 

Ser Gin Leu Cys Ser Leu Leu Glu Pre His Val Tyr Lys Val Thr Asn 
500 505 510 

Phe Pro Gin Ala Gin Ser He He Glr. Trp He Asn Gin Ser Glu Ser 
515 520 525 

Glu Glu Glu Leu Val Lys He Thr Ser Phe Ser Glu Phe He Asn Thr 
530 535 540 

Leu Lys Lys Thr His Lys Tyr Leu Met Glu Glu Ser Phe Lys Thr Glu 
545 550 555 560 

Pro Pro Glu Arg Val Glu Glu Ala Lys Arg Met Ala Thr Tyr Glu Val 
565 570 575 

Thr Thr Ala Leu Ser Ser Phe Leu Lys Tyr Leu Arg Glu Thr Gin Gin 
580 585 590 

Pro Asp Met Gin Leu Leu Leu Leu Ser He Ala Thr Gly Val Gly Tyr 
595 600 605 

Gin Leu Val Asn Ser He Phe Gin His Leu Leu Gly Cys Asp Glu Leu 
610 615 620 

Asn Phe Leu Leu Asp Gin Met Glu Asn Asn Glu His Lys Tyr Gin Glu 
625 630 635 640 

Leu Lys Asn He Cys Asn Tyr Arg Ala Gin Ala Phe Leu Val Leu Thr 
645 650 655 

Ala Leu Arg Ala Thr Val Glu He Thr Asp Val Ser Thr Glu Glu Lys 
660 665 670 

Gly Gin Arg Leu Thr Leu He Gin Gin His Met Gly Ser Leu Leu Ser 
675 680 685 

Glu Glu Val Ala His Val Leu Thr Lys His Gly Glu His His Asp Trp 
690 695 700 

Glu Arg Leu Glu Asn Asp Leu Arg Leu Leu He Glu Gly Asp Tyr Lys 
705 710 715 720 

Ala Thr Thr His Ser Leu Gin Met Asd Glu Val Lys Lys Gin Leu Gin 
725 730 735' 

Ser Xaa Cys His Glu Lys Lys Gin Thr Tyr Lys Gin Gin Gly Asn Glu 
740 745 750 

Asn Arg Thr Lys Glu Met He Glu Asn Gly His Phe Leu Asp Leu Leu 
755 760 765 



Gin Arg Leu Gly Leu Asp Asn Tyr Tyr Pro 
770 775 



<210> 51 
<211> 1712 
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<212> DNA 

<213> Mus musculus 
<400> 51 

tcttcgaaag ccgggctgag gggaatcctg gacaggggaa tcctggacgt ggagatcgtg 60 
agtcatggct gcttcccgag acgctgatga gatccacaag gacgttcaga actactatgg 120 
gaatgtactg aagacatctg cagacctcca gactaatgct tgtgtcacgc gagccaagcc 180 
ggtccccagc tacatccggg aaagtctgca gaatgtacac gaagacgtta gttcgaggta 240 
ttatggctgt ggtctgactg ttcctgagcg gctggaaaac tgccgaattt tggatctggg 300 
tagtgggagt ggcagggatt gctatgtgct tagccagctg gttggtgaga agggacatgt 360 
caccggaata gacatgactg aggtccaggt cgaagtggct aaaacctatc ttgaacacca 420 
catggaaaaa tttggtttcc aggcacccaa tgtgactttt ctccacggcc gcatcgagaa 4 80 
gttggcagag gctgggatcc agagtgagag ctatggtatt gtcatatcca actgtgttat 540 
caaccttgtt cctgataaac aacaagtcct ccaggaggtc tatcgagtgc tgaagcacgg 600 
cggggagctc tatttcagtg acgtctatgc cagccttgaa gtgccagaag acatcaagtc 660 
gcacaaagtt ttatgggggg aatgcctggg aggcgctctg tactggaagg atcttgccat 720 
cattgcccaa aagattgggt tctgccctcc acgtttggtc actgccgata tcattactgt 780 
tgaaaacaag gagctcgaag gggttcttgg tgactgtcgc tttgtgtctg ccacatttcg 84 0 
cctcttcaaa ctccctaaga cagagccagc cgaaagatgc cgagttgttt acaatggagg 900 
aatcaaggga catgaaaagg aactaatttt cgatgcaaat ttcacattca aggaaggcga 960 
agctgttgca gtggatgagg agacggcagc tgtcctgaag aactcacgtt ttgctccgga 1020 
ttttctcttc acacctgttg acgcctcgct gccagctccc caggggccgt tctgagttag 1080 
agacaaaggt tctaatcaga gatccattca agcttgcaga ggactctgac aagatgaagc 1140 
ccagacatgc acctgaaggc acgggaggct gctgtggcaa gaggaaaaac tgctagatct 1200 
acagccagcg cggagcccac cgggctcaag agggtggcta aaggacagtc acagaggctt 1260 
cttagcctgc tcttcgccag tgcacagatt atgtgaaggt ggcaaagcca ccacaagcta 1320 
gaccactgct aagaataaga gtgactttta gaggatgtta attgaaggtt cacagcaaat 1380 
cgcctgcttt tctatttctc tatctcagag ttctggtgcc acctagtggt cagaagtaga 1440 
acttggaagc ccaaggttta ctcaaagggc caaaggcatc atcaacgttg tgagaattat 1500 
cttccttctg gcctaccaca ggacacctct gggttcttct ctgtggttac caggaagcac 1560 
agtacttact aaatttatgc taaccatgac aaaagattgt caactcaaat ttgctaggag 1620 
tattctttag gttgctgtct gcaatttttt tctctgtaac tgaatgaaaa agaaaacaaa 1680 
taaaaaataa atttgacttc gaaaaaaaaa aa 1712 

<210> 52 
<211> 336 
<212> PRT 

<213> Mus musculus 
<400> 52 

Met Ala Ala Ser Arg Asp Ala Asp Glu He His Lys Asp Val Gin Asn 
15 10 15 

Tyr Tyr Gly Asn Val Leu Lys Thr Ser Ala Asp Leu Gin Thr Asn Ala 
20 25 30 

Cys Val Thr Arg Ala Lys Pro Val Pro Ser Tyr He Arg Glu Ser Leu 
35 40 45 

Gin Asn Val His Glu Asp Val Ser Ser Arg Tyr Tyr Gly Cys Gly Leu 
50 55 60 

Thr Val Pro Glu Arg Leu Glu Asn Cys Arg He Leu Asp Leu Gly Ser 
65 70 75 80 

Gly Ser Gly Arg Asp Cys Tyr Val Leu Ser Gin Leu Val Gly Glu Lys 
85 90 95 

Gly His Val Thr Gly He Asp Met Thr Glu Val Gin Val Glu Val Ala 
100 105 110 

Lys Thr Tyr Leu Glu His His Met Glu Lys Phe Gly Phe Gin Ala Pro 
115 120 125 
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Asn Val Thr Phe Le\ is Gly Arg lie Glu Lys Leu Ala Glu ; "Gly 
130 135 140 

He Gin Ser Glu Ser Tyr Gly He Val He Ser Asn Cys Val He Asn 
145 150 155 160 

Leu Val Pro Asp Lys Gin Gin Val Leu Gin Glu Val Tyr Arg Val Leu 
165 170 175 

Lys His Gly Gly Glu Leu Tyr Phe Ser Asp Val Tyr Ala Ser Leu Glu 
180 185 190 

Val Pro Glu Asp He Lys Ser His Lys Val Leu Trp Gly Glu Cys Leu 
195 200 205 

Gly Gly Ala Leu Tyr Trp Lys Asp Leu Ala lie He Ala Gin Lys He 
210 215 220 

Gly Phe Cys Pro Pro Arg Leu Val Thr Ala Asp He He Thr Val Glu 
225 230 235 240 

Asn Lys Glu Leu Glu Gly Val Leu Gly Asp Cys Arg Phe Val Ser Ala 
245 250 255 

Thr Phe Arg Leu Phe Lys Leu Pro Lys Thr Glu Pro Ala Glu Arg Cys 
260 265 270 

Arg Val Val Tyr Asn Gly Gly He Lys Gly His Glu Lys Glu Leu He 
275 280 285 

Phe Asp Ala Asn Phe Thr Phe Lys Glu Gly Glu Ala Val Ala Val Asp 
290 295 300 

Glu Glu Thr Ala Ala Val Leu Lys Asn Ser Arg Phe Ala Pro Asp Phe 
305 310 315 320 

Leu Phe Thr Pro Val Asp Ala Ser Leu Pro Ala Pro Gin Gly Pro Phe 
325 330 335 



<210> 53 

<211> 3599 

<212> DNA 

<213> Mus musculus 

<400> 53 

agagacagcg tgatcccggc ctcccacggg gcagctttta ctgtctaggg aagaaatccc 60 
caaagtccat ggagtctgaa gactctgtca agcctcgcta ggaaacctag gagttttaga 120 
gggcacttgg caccggaagc tagccgggta ggcggagcct cacctggatt gagttcacag 180 
ctgcctagac aggctcagac taggtgctgg gcacctggga ggaggaggag acattagcag 240 
caaaggctgt taacagaagt gcctgcctag gcttggaggc aagacgctgc tgttcacagt 300 
gcgagacgga ggtaggagta taatggctgt ccaggtgctg cggcagatgg tctacttcct 360 
actgagtctg ttttctctgg tgcaaggtgc acacagtggc agcccccgag aagacttccg 420 
cttctgtggc cagcggaacc agacccaaca gagcaccctc cactatgatc aatcttcaga 480 
gcctcacatc tttgtgtgga acacagagga gaccctcaca attcgtgccc ccttcctggc 540 
agccccagat attccccgct tcttcccaga gcctagaggg ctctatcact tctgcctcta 600 
ctggagtcgc cacactggga gactccactt gcgctatggc aagcatgact acctgcttag 660 
tagccaagcc tccagactcc tctgcttcca gaaacaggag cagagcctga agcagggagc 720 
cccgctgatc gccacctctg tcagctcctg gcagattccc cagaacacca gcctgcctgg 780 
ggctccgagc ttcatcttct ccttccacaa tgccccacac aaggtctccc acaatgcatc 840 
tgtggacatg tgrgacctca agaaggaatt gcagcagctt agcaggtacc tgcagcaccc 900 
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tcaaaaggct gccaagcc ccaccgcagc gttcatcagc cagcagttac ac, ^ctgga' 960 
gtcaaagctg acctctgtga gcttcctggg agacacatta tcctttgagg aggaccgggt 1020 
caatgctaca gtgtggaagc tgccacccac agccggtcta gaggatctgc atatccactc 1080 
ccagaaggag gaggagcaga gtgaggtcca ggcatactcg ctgttgcttc cccgggccgt 1140 
attccagcag accagaggcc gtcgccggga tgacgccaag aggctcctgg tagtagactt 1200 
cagcagccaa gctttgttcc aggacaagaa ttctagccaa gtcctgggtg agaaggtctt 1260 
gggtattgtc gtgcagaaca ccaaagtcac caacctctca gatccggtgg tactcacctt 1320 
ccagcaccag cctcagccaa aaaatgtgac tctgcagtgc gtgttctggg ttgaagaccc 1380 
ggcatcaagc agcacaggga gctggagcag tgcaggctgc gagacagtga gcagagacac 1440 
acagacatcc tgcctgtgca accacctgac ctactttgca gtgctgatgg tgtcatccac 1500 
agaggtagaa gccactcaca aacactacct cacgctcctg tcctacgtgg gctgtgtcat 1560 
ctctgctctg gcttgtgtct tcactatcgc tgcctacctc tgctccagga ggaagtcacg 1620 
tgactacacc atcaaagtcc acatgaacct gctgtccgct gtcttcctgc tggacgtgag 1680 
cttcctgctc agcgagcctg tggcactgac gggctccgaa gcagcctgtc gcaccagtgc 1740 
catgttcctg cacttctccc tgcttgcctg cctctcctgg atgggcctcg agggctacaa 1800 
tctctaccga ctggtggtgg aggtcttcgg tacctatgtg cccggctatc tgctcaagct 1860 
gagcatcgtg ggctggggtt ttcctgtctt cctggtcact ctggcggcgt tggtggatgt 1920 
gaataactac ggccccatta tcctagctgt gcgccggact ccggaacgtg tcacctaccc 1980 
ctctatgtgc tggatccggg actccctggt gagctatgtc accaacctgg gcctcttcag 2040 
tctggtgttc ctgttcaacc tggctatgct ggccaccatg gtggtgcaga tcctgcggct 2100 
tcgcccgcac agccagaact ggccccacgt gctgaccctg ctgggcctca gcctggtcct 2160 
tggcctcccc tgggccttgg tcttcttttc ctttgcttcc ggcaccttcc agcttgtcat 2220 
cctctacctc ttcagcatca taacttccta ccaaggcttc ctcatcttcc tgtggtactg 2280 
gtccatgcgg ttccaggccc aaggcggccc ctcccctctg aagaacaact cagacagcgc 2340 
caaactcccc atcagctccg gcagcacctc ctccagccgc atctaagcca ccgccacacc 2400 
tcccctccgg gaggacacat gcatggcgtc cgctcacgat gtctgtggcc cagtgctgtg 24 60 
cccacccagc ctttgttggt tagtggcata ctagagaagg ccctggtcct tgaaggcgta 2520 
gggctgttgc tctggtaggt agatacctag cttgccttgg ggacgactct ggtcctcaaa 2580 
gggctcagaa gcacactgcc attctgtttg tggggccgtt tcagtctgga gctaaggcct 2640 
tgtctttctg gccacctctg ggtccagctg ttgctgctgg gtgttgagac ctgcagaccc 2700 
aagctggggt tagatctcga aggaggctga cacatccggc ctgagacaca gctaactgtc 2760 
ttgacttgct gctctgtctc tgtggtcacc atgcagatcc cgagggtggc actgggggta 2820 
aatgttctgg gagaaggttt ggaggcagag caccttagga gctgagcatc tcccccagcc 2880 
tttctgcaaa ccctcctctt cattccccat ccccaacccc tcctctgtgt tcccctaacc 2940 
ctccacctga agcctggggt cctagaccaa tgctgtgatt tggggtggta gttcccagca 3000 
gtttcctggt gccagctatc aacttctgtc tgttgtgtgg gcttiiggcct ctgactcagg 3060 
gcaggtttct gtctgagccc tctctccaag ctgcctcacc tttgctcgca cctcagaggg 3120 
acctccatct ctcctgaagc ctcctccctc tggcaagtac tgggaracag ccaccctttc 3180 
aacccagcac tctgaagacc aagacagccc cctctggtga cactggccaa gcttgatctt 3240 
tttcctaaga agtggtcttc agatccccgc aggtcgctca gaagacactg ggctgcctag 3300 
tgtgaattct gtcctactaa cgtacagtga gcagctcctc acccccaccc ccgcaaaagc 3360 
tctcaccaag tcctggagtg tcaggcaggg ggctggaaat ccaggaggac ttcctgcaaa 3420 
aggcagcatt tcatcttgac ctcagccttc aggttgggga gaatgttctt tttaaatacc 3480 
agttcatttg tcttttgata ttaaagctct ttatagagag tctggaaact gtaggcgatt 3540 
gtcgagaaga gaaataaaaa tgagctgtta tctaatgcca tggcaaagca gcacaaaaa 3599 

<210> 54 
<211> 687 
<212> PRT 

<213> Mus musculus 
<400> 54 

Met Ala Val Gin Val Leu Arg Gin Met Val Tyr Phe Leu Leu Ser Leu 
15 10 15 

Phe Ser Leu Val Gin Gly Ala His Ser Gly Ser Pro Arg Glu Asp Phe 
20 25 30 

Arg Phe Cys Gly Gin Arg Asn Gin Thr Gin Gin Ser Thr Leu His Tyr 
35 40 45 

Asp Gin Ser Ser Glu Pro His lie Phe Val Trp Asn Thr Glu Glu Thr 
50 55 60 
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Leu Thr lie Arg Alt ro Phe Lea Ala Ala Pro Asp lie Pro A Phe 
65 70 75 80 

Phe Pro Glu Pro Arg Gly Leu Tyr His Phe Cys Leu Tyr Trp Ser Arg 
85 90 95 

His Thr Gly Arg Leu His Leu Arg Tyr Gly Lys His Asp Tyr Leu. Leu 
100 105 110 

Ser Ser Gin Ala Ser Arg Leu Leu Cys Phe Gin Lys Gin Glu Gin Ser 
115 120 125 

Leu Lys Gin Glv Ala Pro Leu He Ala Thr Ser Val Ser Ser Trp Gin 
130 135 140 

He Pro Gin Asn Thr Ser Leu Pro Gly Ala Pro Ser Phe He Phe Ser 
145 150 155 160 

Phe His Asn Ala Pro His Lys Val Ser His Asn Ala Ser Val Asp Met 
165 170 175 

Cys Asp Leu Lys Lys Glu Leu Gin Gin Leu Ser Arg Tyr Leu Gin His 
180 185 190 

Pro Gin Lys Ala Ala Lys Arg Pro Thr Ala Ala Phe He Ser Gin Gin 
195 200 205 

Leu Gin Ser Leu Glu Ser Lys Leu Thr Ser Val Ser Phe Leu Gly Asp 
210 215 220 

Thr Leu Ser Phe Glu Glu Asp Arg Val Asn Ala Thr Val Trp Lys. Leu 
225 230 235 240 

Pro Pro Thr Ala Gly Leu Glu Asp Leu His He His Ser Gin Lys Glu 
245 250 255 

Glu Glu Gin Ser Glu Val Gin Ala Tyr Ser Leu Leu Leu Pro Arg Ala 
260 265 270 

Val Phe Gin Gin Thr Arg Gly Arg Arg Arg Asp Asp Ala Lys Arg Leu 
275 280 285 

Leu Val Val Asp Phe Ser Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser 
290 295 300 

Ser Gin Val Leu Gly Glu Lys Val Leu Gly He Val Val Gin Asn Thr 
305 310 315 320 

Lys Val Thr Asn Leu Ser Asp Pro Val Val Leu Thr Phe Gin His Gin 
325 330 335 

Pro Gin Pro Lys Asn Val Thr Leu Gin Cys Val Phe Trp Val Glu Asp 
340 345 350 

Pro Ala Ser Ser Ser Thr Gly Ser Trp Ser Ser Ala Gly Cys Glu Thr 
355 360 365 

Val Ser Arg Asp Thr Gin Thr Ser Cys Leu Cys Asn His Leu Thr Tyr 
370 375 380 

Phe Ala Val Leu Met Val Ser Ser Thr Glu Val Glu Ala Thr His Lys 
385 390 395 400 



His Tyr Leu Thr Leu Leu Ser Tyr Val Gly Cys Val He Ser Ala Leu 
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401 



410 



Ala Cys Val Phe 
420 



Thr lie Ala Ala Tyr Leu Cys Ser Arg Arg Lys Ser 
425 430 



Arg Asp Tyr Thr 
435 



He Lys Val His Met Asn Leu Leu Ser Ala Val Phe 
440 445 



Leu Leu Asp Val 
450 



Ser Phe Leu Leu Ser Glu Pro Val Ala Leu Thr Gly 
455 460 



Ser Glu Ala Ala 
465 



Cys Arg Thr Ser Ala Met Phe Leu His Phe Ser Leu 
470 475 480 



Leu Ala Cys Le 1 - 



Ser Trp Met Gly Leu Glu Gly Tyr Asn Leu Tyr Arg 
485 490 495 



Leu Val Val Glu 
500 



Val Phe Gly Thr Tyr Val Pro Gly Tyr Leu Leu Lys 
505 510 



Leu Ser He Val 
515 



Gly Trp Gly Phe Pro Val Phe Leu Val Thr Leu Val 
520 525 



Ala Leu Val Asp 
530 



Val Asn Asn Tyr Gly Pro He He Leu Ala Val Arg 
535 540 



Arg Thr Pro Glu 
545 



Arg Val Thr Tyr Pro Ser Met Cys Trp He Arg Asp 
550 555 560 



Ser Leu Val Ser 



Tyr Val Thr Asn Leu Gly Leu Phe Ser Leu Val Phe 
565 570 575 



Leu Phe Asn Leu 
580 



Ala Met Leu Ala Thr Met Val Val Gin He Leu Arg 
585 590 



Leu Arg Pro His 
595 



Ser Gin Asn Trp Pro His Val Leu Thr Leu Leu Gly 
600 605 



Leu Ser Leu Val 
610 



Leu Gly Leu Pro Trp Ala Leu Val Phe Phe Ser Phe 
615 620 



Ala Ser Gly Thr 
625 



Phe Gin Leu Val He Leu Tyr Leu Phe Ser He He 
630 635 640 



Thr Ser Tyr Gin Gly Phe Leu He Phe Leu Trp Tyr Trp Ser Met Arg 
645 650 655 



Phe Gin Ala Gin 
660 



Gly Gly Pro Ser Pro Leu Lys Asn Asn Ser Asp Ser 
665 670 



Ala Lys Leu Pre 
675 



He Ser Ser Gly Ser Thr Ser Ser Ser Arg He 
680 685 



<210> 55 
<211> 3590 
<212> DNA 

<213> Mus rauscuius 
<400> 55 

ctcagggaag gccgatcttc cgggtggagg gggaagegge gtgactggag tggaaatttt 60 
tcccaacaca acr_=tcgga ggcaacatat tggaagggac teggggagge eggagtccaa 120 
atggaagtgg ctgaaagaaa cttctcgccc tgetgattet gagccccgcg tcgtgccgcg 180 
cgccctcaat taccucatcg acctgtggtc ttgacagaac attcttcaca atccaaaaga 240 
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aaaagcagac tggttcg; tgtttgacgg ctatgatagc tgcagtgagg ac Aagtag'3t)0 
cagctccagc tctgaggaga gtgaagaaga agttgctcc- ttaccttcca atctcccaat 360 
catcaagaat aatggacaag tctacacata cccagatggt aaatctggca tggctacctg 420 
tgagatgtgt gggatggtcg gtgtgcgaga tgctttttac tctaaaacga aacgtttctg 480 
cagtgtttcc tgttcaagaa gttactcgtc aaactctaag aaggcaagca ttctggccag 540 
acttcagggt aagcctccaa caaagaaagc aaaagtcctt caaaaacaac ctttagttgc 600 
taaattagct gcctatgccc agtatcaagc taccttgcaa aatcaagcaa agacaaaagc 660 
aggcaattct gcaatctctg tggaaggctt cagctggggr aactacatca acagcaacag 720 
cttcatagca gctccagtgg cctgttttaa gcatgcacct: atggggacct gctggggtga 780 
tatctcggaa aatgtaagga tagaagttcc caatacagac tgcagtctac ctaccaaagt 84 0 
cttctggatt gctggaatta taaaattagc aggttataat gcccttttga gatatgaagg 900 
atttgaaaat gattcttctc tggacttctg gtgcaatata tgtgggtctg atattcatcc 960 
agttggttgg tgtgcagcta gtggaaaacc tctcgttccr cctagaactg ttcaacataa 1020 
atatacaaac tggaaagctt ttctagtaaa aagacttact ggtgccaaaa cacttcctcc 1080 
tgatttctca caaaaggttt ctgagagtat gcaatatcct ttcaaacctt gcatgagagt 1140 
agaagtagtt gacaagaggc atttatgtcg aacaagagta gcagtggtgg aaagtgtaat 1200 
tggaggacga ctacggctgg tgtatgaaga gagtgaagat ggaacagacg acttctggtg 1260 
ccacatgcac agccccttaa tccaccatat tggatggtca agaagcatag gccatcgatt 1320 
caagagatca gatattacga agaaacagga cggacatttc gatacacctc cacacttatt 1380 
tgctaaggta aaagaagtag accagagtgg agaatggttc aaagaaggaa tgaaattgga 14 40 
agctatagac ccattaaatc tttctacaat atgtgttgcc accattagaa aggt'gctggc 1500 
tgatggattc ctgatgattg ggattgatgg ctcagaagca gcagatggat ctgactggtt 1560 
ctgttatcat gcaacctctc cttccatttt ccctgtgggr ttctgtgaaa ttaacatgat 1620 
agaactgact ccacccagag gttacacaaa acttcctttt aaatggtttg actacctcag 1680 
ggaaaccggc tccattgcag caccagtaaa actatttaar aaggatgttc caaaccacgg 1740 
attccggtta ggaatgaaat tagaagctgt agatctcatg gagccacggt taatatgtgt 1800 
agccacagtt actcgaatta ttcaccatct cttgaggata cattttgatg gttgggaaga 1860 
agagtatgac cagtgggtag actgtgagtc ccctgacctc tatcctgtag ggtggtgtca 1920 
gttaactgga tatcaactac agcctccagc atcacagtca tcaagagaaa gccaatcagc 1980 
ttcttcaaaa cagaagaaaa aggctaagtc tcagcaatac aaaggacata agaaaatgac 2040 
cacgtcacag ctgaaggagg agctgctgga cggggaggac tatagcttcc tccatggagc 2100 
atctgaccag gaaagcaatg gctctgccac cgtctacatc aaacaagagc catgaggcga 2160 
ctcggaaacc atgggcaggc ggggctgttt acaggactga tttggaatca gccagctgta 2220 
tagcgggcta ttctactggg acattttgct aaacacagaa aaaagttcag ttccagattt 2280 
ttcaggtggg ggggaaacta ttttggtggg ggggcaattt ttcaatttat aaagacggac 2340 
aatttttgtg ttgtatttga agcttttgaa agaattttgt aatattttcc aagtttggat 2400 
ttatgtgcat tgttaacaag aactgaaatt ataacttttt tggtaagata aaagtttagg 2460 
tagcaggatt gaaggaaatg attaagaagg atatagttgt aaatgcacat gaactgtcat 2520 
tacaaatgaa ccttcttggt acctgttggg agattttttg gattttagag ttaggccagt 2580 
cacatctcca gcttcctttg ctgcgaaaat atgcaactga acccctcaca gagggctcat 2640 
caccatgtag atcacggaag gggtcattaa ttgtgctcgc tgacgtttta ttgcagccca 2700 
tttaactgtt tgtacagaaa ctttttcatt ctgctaaaac ttatttggag ttgtatatga 2760 
aactaggaga actctggata cttttatatg ctctccttca cgtaaagatg attaaaattg 2820 
tctaacacta aagtgttaaa ctggaatggt tgacaaagta cacaagatct cagtctacat 2880 
aaagggttgg gggaaattag tattttccta agtttattcc gttttccttg ctagaaaaat 2940 
cccataaaag ggatattctt gtcagacctc tgccatttcc tccatctgtg agagagcaac 3000 
gtgatgtaac cacacctgag agcaggtact gtcctgtgtt acaaaagaat cacatggaaa 3060 
attgttaacc aaaataagtg tagatttcag aatcattcca gctctactct taactgtcct 3120 
gaatttgtta gaaactgatt tgaaagaaat gtttcttaaa attccataca cacacacaca 3180 
catattatga attgtcttta attgtagtca gaatttatta taacaatatt tgatttggac 3240 
cattttaaac attccctatt ttaaaattca tacggcttcc ttaagaagta gaatgaaagg 3300 
gtaaattggt gacatgtttg ctctctgcat tttctaacct tctaccgaat tgtgactgac 3360 
tcagagagct ctagcattta ccagtgaggt tcagaaacta aactctcagg aattcattgc 34 20 
atctgttgta gaagtgcttc tgggtctgag cctggcctcc tcagagtggt aatactgccc 3480 
acttcctctg gaaaataggc agggctaatg agaaactaat caaatgatta acctctgcgg 3540 
cccttcagcc tttggaatgc taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3590 

<210> 56 
<211> 631 
<212> PRT 

<213> Mus musculus 
<400> 56 

Met Phe Asp Gly Tyr Asp Ser Gys Ser Glu Asp Thr Ser Ser Ser Ser 
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1 L 10 

Ser Ser Glu Glu Ser Glu Glu Glu Val Ala Pre Leu Pro Ser Asn Leu 
20 25 30 

Pro He He Lys Asn Asn Gly Gin Val Tyr Thr Tyr Pro Asp Gly Lys 
35 40 45 



Ser Gly 
50 



Met Ala Thr Cys Glu Met Cys Gly Met Val Gly Val Arg Asp 
55 60 



Ala Phe 
65 



Tyr Ser Lys Thr Lys Arg Phe Cys Ser Val Ser Cys Ser Arg 
70 75 80 



Ser Tyr 
Gly Lys 
Val Ala 



Gin Ala 
130 

Ser Trp 
145 

Ala Cys 



Ser Ser Asn 
85 

Pro Pro Thr 
100 

Lys Leu Ala 
115 

Lys Thr Lys 
Gly Asn Tyr 



Phe Lys His 
165 



Ser Lys Lys Ala Ser He Leu Ala Arg Leu Gin 
90 95 

Lys Lys Ala Lys Val Leu Gin Lys Gin Pro Leu 
105 110 

Ala Tyr Ala Gin Tyr Gin Ala Thr Leu Gin Asn 
120 125 

Ala Gly Asn Ser Ala He Ser Val Glu Gly Phe 
135 140 

He Asn Ser Asn Ser Phe He Ala Ala Pro Val 
150 155 • 160 

Ala Pro Met Gly Thr Cys Trp Gly Asp He Ser 
170 175 



Glu Asn 



Val Arg He 
180 



Glu Val Pro Asn Thr Asp Cys Ser Leu Pro Thr 
185 190 



Lys Val 



Phe Tro He 
195 



Ala Gly lie He Lys Leu Ala Gly Tyr Asn Ala 
200 205 



Leu Leu Arg Tyr Glu Gly Phe Glu Asn Asp Ser Ser Leu Asp Phe Trp 
210 215 220 

Cys Asn lie Cys Gly Ser Asp He His Pro Val Gly Trp Cys Ala Ala 
225 230 235 240 

Ser Gly Lys Pro Leu Val Pro Pro Arg Thr Val Gin His Lys Tyr Thr 
245 250 255 

Asn Trp Lys Ala Phe Leu Val Lys Arg Leu Thr Gly Ala Lys Thr Leu 
260 265 270 



Pro Pro 



Asp Phe Ser 
275 



Gin Lys Val Ser Glu Ser Met Gin Tyr Pro Phe 
280 285 



Lys Pro Cys Met Arg Val Glu Val Val Asp Lys Arg His Leu Cys Arg 
290 295 300 

Thr Arg Val Ala Val Val Glu Ser Val He Gly Gly Arg Leu Arg Leu 

305 310 315 320 



Val Tyr 



Glu Glu Ser 
325 



Glu Asp Gly Thr Asp Asp Phe Trp Cys His Met 
330 335 



His Ser Pro Leu lie His His lie Gly Trp Ser Arg Ser He Gly His 
340 345 350 
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Arg Phe Lys 
355 



Arg Ser Asp lie Thr Lys Lys Gin Asp Gly His Phe Asp 
360 365 



Thr Pro Pro 
370 



His Leu Phe Ala Lys Val Lys Glu Val Asp Gin Ser Gly 
375 380 



Glu Trp Phe 
385 



Lys Glu Gly Met Lys Leu Glu Ala lie Asp Pro Leu Asn 
390 395 400 



Leu Ser Thr He Cys Val Ala Thr He Arg Lys Val Leu Ala Asp. Gly 

405 410 415 

Phe Leu Met He Gly He Asp Gly Ser Glu Ala Ala Asp Gly Ser Asp 

420 425 430 



Trp Phe Cys 
435 



Tyr His Ala Thr Ser Pro Ser He Phe Pro Val Gly Phe 
440 445 



Cys Glu He 
450 



Asn Met He Glu Leu Thr Pro Pro Arg Gly Tyr Thr Lys 
455 460 



Leu Pro Phe 
4 65 



Lys Trp Phe Asp Tyr Leu Arg Glu Thr Gly Ser He Ala 
470 475 480 



Ala Pro Val Lys Leu Phe Asn Lys Asp Val Pro Asn His Gly Phe Arg 
485 490 495 

Val Gly Met Lys Leu Glu Ala Val Asp Leu Met Glu Pro Arg Leu He 

500 505 510 



Cys Val Ala 
515 



Thr Val Thr Arg He He His Arg Leu Leu Arg He His 
520 525 



Phe Asp Gly 
530 



Trp Glu Glu Glu Tyr Asp Gin Trp Val Asp Cys Glu Ser 
535 540 



Pro Asp Leu 
545 



Asn Pro Val Gly Trp Cys Gin Leu Thr Gly Tyr Gin Leu 
550 555 560 



Gin Pro Pro Ala Ser Gin Ser Ser Arg Glu Ser Gin Ser Ala Ser Ser 
565 570 575 

Lys Gin Lys Lys Lys Ala Lys Ser Gin Gin Tyr Lys Gly His Lys Lys 

580 585 590 



Met Thr Thr 
595 



Ser Gin Leu Lys Glu Glu Leu Leu Asp Gly Glu Asp Tyr 
600 605 



Ser Phe Leu 
610 



His Gly Ala Ser Asp Gin Glu Ser Asn Gly Ser Ala Thr 
615 620 



Val Tyr He 
625 



Lys Gin Glu Pro 
630 



<210> 57 
<211> 1650 
<212> DNA 

<213> Mus musculus 
<400> 57 

ggggcttcac tctccagcct tggagagcag tcagggaagg cctccaggag gaagcagcta 60 
ccrgtggcag ggcagagcag tgcccgtacc agtctcccgg aacttcagaa ctgcggtggc 120 
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catggagtct gtggagct cggtcaaaga cggcatcctc taccagcagc ac iaagtt 180 
tggcaagaaa tgctggcgca aagtgtgggc tctgctgtat gcgggaggcc catcaggggt 240 
agctcggcta gaaagctggg acgtgcgtga tggtggcctg ggaccagcag gcgacaggtc 300 
cacagggccc agccgtcgag gggaacgccg ggtcatacgc ttggctgact gtgtatctgt 360 
cctgcctgcg gatggcgaga gctgtcccag ggacactggt gccttcctga ttaccaccac 420 
tgagcgaagc cacctgttgg ctgcacagca ccgccagtcc tgggtggacc ccatctgtca 480 
gctggccttc ccgggtaccg gagaatgttc gtcaggatca ggacaggctg agaatccaaa 540 
aaggggcttt gttcccatgg aagaaaactc tatctactcc tcctggcagg aagtgaccga 600 
atttccggtg atcgtgcaga agacagaagc cacctcccgc tgccagctga aaggacccta 660 
cctcctggtg ctgggccaag atgacatcca actgagggag acatccaagc cccaggcctg 720 
ttttagctgg ccctaccgtt tcctgcgcaa gtacggctct gacaagggtg tgttctcgtt 780 
tgaggctggc cgccgctgtg actcaggtga gggccttttt gccttcagta gcccgcgtgc 840 
cccagacata tgtggggttg tggctgccgc cattgcccgc cagcgggagc gtcttccaga 900 
gctggccatg tccccaccct gccccctgcc tcgggccctc tccctgccct ccctagagcc 960 
ccctggagag cttcgggagg tggccccagg atttgagctg cccactccca gaaagctgcc 1020 
tctaactgat cccgggcctc aaagcctacc attgctgctc agccccaaca caagaaggac 1080 
cggcatccgg tctctatgcg tccgtgtgca agcagaccag caagcacaca ggcacggcgg 114 0 
agcatttcta tgagaacgtg tgcatgctgg aggccagcct tgggctgacc aatgggggtc 1200 
ctgaagccca agagggcccc cttggtggcc gcagccccct tgggcagcct tatctaccat 1260 
aacactgagg atctgagttg gccgggctcg gcccaggaca gcaatctgga agcccagtac 1320 
cggaggctgc cggaactgga gctggatgag gccggaagcg ccggccgctc tggagcgcag 1380 
gcaggcatca aggccaagct ggtgaccctg ctgacccgtg aacggaagaa gggccccgcc 144 0 
ccctgtgacc ggccctgaag gcctgagcgg ccagccactg caggacagag gtgatcaccc 1500 
aagaccagga acaacttcga acataacccg tctactctga cctgcaggga caagccaggt 1560 
ggcccgggga ggagccacac tctgccctac ctcctccctc agactgtaca gattgaacag 1620 
taataaagct tgcctatcaa cttcaaaaaa 1650 

<210> 58 
<211> 359 
<212> PRT 

<213> Mus musculus 
<400> 58 

Met Glu Ser Val Glu Pro Pro Val Lys Asp Gly He Leu Tyr Gin Gin 
15 10 15 

His Val Lys Phe Gly Lys Lys Cys Trp Arg Lys Val Trp Ala Leu Leu 
20 25 30 

Tyr Ala Gly Gly Pro Ser Gly Val Ala Arg Leu Glu Ser Trp Asp Val 
35 40 45 

Arg Asp Gly Gly Leu Gly Pro Ala Gly Asp Arg Ser Thr Gly Pro Ser 
50 55 60 

Arg Arg Gly Glu Arg Arg Val He Arg Leu Ala Asp Cys Val Ser Val 
65 70 75 80 

Leu Pro Ala Asp Gly Glu Ser Cys Pro Arg Asp Thr Gly Ala Phe Leu 
85 90 95 

He Thr Thr Thr Glu Arg Ser His Leu Leu Ala Ala Gin His Arg Gin 
100 105 110 

Ser Trp Val Asp Pro He Cys Gin Leu Ala Phe Pro Gly Thr Gly Glu 
115 120 125 

Cys Ser Ser Gly Ser Gly Gin Ala Glu Asn Pro Lys Arg Gly Phe Val 
130 135 140 

Pro Met Glu Glu Asn Ser He Tyr Ser Ser Trp Gin Glu Val Thr Glu ■ 
145 150 155 160 

Phe Pro Val He Val Gin Lys Thr Glu Ala Thr Ser Arg Cys Gin Leu 
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161 170 1 

Lys Gly Pro Tyr Leu Leu Val Leu Gly Gin Asp Asp lie Gin Leu Arg 
180 185 190 

Glu Thr Ser Lys Pro Gin Ala Cys Phe Ser Trp Pro Tyr Arg Phe Leu 
195 200 205 

Arg Lys Tyr Gly Ser Asp Lys Gly Val Phe Ser Phe Glu Ala Gly Arg 
210 215 220 

Arg Cys Asp Ser Gly Glu Gly Leu Phe Ala Phe Ser Ser Pro Arg Ala 
225 230 235 240 

Pro Asp He Cys Gly Val Val Ala Ala Ala He Ala Arg Gin Arg Glu 
245 250 255 

Arg Leu Pro Glu Leu Ala Met Ser Pro Pro Cys Pro Leu Pro Arg Ala 
260 265 270 

Leu Ser Leu Pro Ser Leu Glu Pro Pro Gly Glu Leu Arg Glu Val Ala 
275 280 285 

Pro Gly Phe Glu Leu Pro Thr Pro Arg Lys Leu Pro Leu Thr Asp Pro 
290 295 300 

Gly Pro Gin Ser Leu Pro Leu Leu Leu Ser Pro Asn Thr Arg Arg Thr 
305 310 315 320 

Gly He Arg Ser Leu Cys Val Arg Val Gin Ala Asp Gin Gin Ala His 
325 330 335 

Arg His Gly Gly Ala Phe Leu Glu Arg Val His Ala Gly Gly Gin Pro 
340 345 350 



Trp Ala Asp Gin Trp Gly Ser 
355 



<210> 59 

<211> 1750 

<212> DNA 

<213> Mus musculus 



<400> 59 

ggcctccgac atattgcccg caggagctgc ggcggtgagc ggagagcgcc gggggaagga 60 

gatgggagga cgaagaggtc ccaacaggac atcttactat cgaaacccgc tctgtgagcc 120 

aggatcatca ggggcctctg gtggaggcca ctcttctagt gcatccgtga gcagcgtccg 180 

ttctcgaagc aggaccactt ctgggacggg cctctccagc cctccgctgg ccgcgcagac 240 

cgttgtgcct ctacagcact gcaagatccc cgagctgccc gtccaggcca gcattctgtt 300 

cgagctacag ctcttcttct gtcagctcat agccctgttc gtgcactaca tcaacatcta 360 

taagacagtc tggtggtatc caccctcgca cccaccctcc cacacttccc tgaacttcca 420 

cctgatcgac ttcaacttgc tgatggtgac cgccattgtt ctgggccgcc gattcatcgg 4 80 

gtccatcgtg aaggaggctt ctcagagggg gaaaggtctc cctcttccgc tccatcctgc 540 

tgttcctcac ccgcttcacg ttctcacggc gacaggctgg agtctgtgcc ggtccctcat 600 

ccacctcttc aggacctact ccttcctgaa cctcctgttc ctctgctatc cgtttgggat 660 

gtacattccg ttcctgcagc tgaactatga tcttcgcaag acgaacctct tcacccacat 720 

ggcttccatg ggaccccgag aggcagtcag tggcctggca aggagtcggg actactttct 780 

gacactgcgg gagacttgga agcagcatac acgacagcta tatggcccgg aagccatgcc 840 

cacccatgcc tgctgcttgt cacccagcct cattcgcaat gaagttgagt tcctcaaaat 900 

ggacttcaat tggcgaatga aggaagtact tgtcagctcc atgctgagtg cctactatgt 960 

ggcctttgta cctgtgtggt ttgtgaagaa tacacattac tatgacaagc gctggtcctg 1020 

tgagctcttc ctgctggtgt ccatcagtac ctccgtgatc ctcatgcagc acctgttgcc 1080 

tgccagctac tgtgacctgc tgcacaaggc cgccgcccac ctggcgtgct ggcagaaggt 114 0 
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ggacccagct ctgtgcti atgtgctgca gccaccgtgg actgaagagt gt gtggcc 1200 

acagggtgtg ctggtgaaac acagcaagaa tgtctacaaa gcagtgggcc actacaacgt 1260 

ggcatcccct ccgatgtctc ccacttcsgc ttccatttct ttttcagcaa ccccctgcgg 1320 

atcctcaaca tccttttgct gctggaeggt gccgtcattg tctaccagct gtactcctta 1380 

atgtcctcag agaagtggca ccagacca-c tccctggccc tcatcctctt cagcaactac 1440 

tatgccttct tcaagctgct ccgggaccgc ctggtattgg gcaaagctac tcctactcag 1500 

ccagccccca gaggacctgg accaccggtt ctcctgagtc ccagggtgat cctccaaaca 1560 

acatccgggc ttcacccagg ggcttccrgg ccaagggctc ttggggttgg gggggaggga 1620 

tacaaaaaac aaaaacaaaa aacaacaaaa aaaaaccaac ccaccagagc tttgtatttt 1680 

cgttacgtac tgtttctttc tttgatsatt gatgtggtaa gaaaaaagtc ttatttttat 1740 

actccaaaaa 1750 

<210> 60 
<211> 439 
<212> PRT 

<213> Mus musculus 
<40O> 60 

Met Gly Gly Arg Arg Gly Pro Asn Arg Thr Ser Tyr Tyr Arg Asn Pro 
15 10 15 

Leu Cys Glu Pro Gly Ser Ser Gly Ala Ser Gly Gly Gly His Ser Ser 
20 " 25 30 

Ser Ala Ser Val Ser Ser Val Arg Ser Arg Ser Arg Thr Thr Ser Gly 
35 40 45 

Thr Gly Leu Ser Ser Pro Pro Leu Ala Ala Gin Thr Val Val Pro Leu 
50 55 60 

Gin His Cys Lys lie Pro Glu Leu Pro Val Gin Ala Ser He Leu Phe 
65 70 75 80 

Glu Leu Gin Leu Phe Phe Cys Gin Leu He Ala Leu Phe Val His Tyr 
85 90 95 

He Asn He Tyr Lys Thr Val Trp Trp Tyr Pro Pro Ser His Pro Pro 
100 * 105 110 

Ser His Thr Ser Leu Asn Phe His Leu He Asp Phe Asn Leu Leu Met 
115 120 125 

Val Thr Ala He Val Leu Gly Arg Arg Phe He Gly Ser He Val Lys 
130 135 140 

Glu Ala Ser Gin Arg Gly Lys Gly Leu Pro Leu Pro Leu His Pro Ala 
145 150 155 160 

Val Pro His Pro Leu His Val Leu Thr Ala Thr Gly Trp Ser Leu Cys 
165 170 175 

Arg Ser Leu He His Leu Phe Arg Thr Tyr Ser Phe Leu Asn Leu Leu 
180 * 185 190 

Phe Leu Cys Tyr Pro Phe Gly Met Tyr He Pro Phe Leu Gin Leu Asn 
195 200 205 

Tyr Asp Leu Arg Lys Thr Asn Leu Phe Thr His Met Ala Ser Met Gly 
210 215 220 

Pro Arg Glu Ala Val Ser Gly Leu Ala Arg Ser Arg Asp Tyr Phe Leu 
225 230 235 240 

Thr Leu Arg Glu Thr Trp Lys Gin His Thr Arg Gin Leu Tyr Gly Pro 
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24; 



250 



Glu Ala Met Pro Thr His Ala Cys Cys Leu Ser Pro Ser Leu lie Arg 
260 265 270 

Asn Glu Val Glu Phe Leu Lys Met Asp Phe Asn Trp Arg Met Lys Glu 
275 280 285 

Val Leu Val Ser Ser Met Leu Ser Ala Tyr Tyr Val Ala Phe Val Pro 
290 295 300 

Val Trp Phe Val Lys Asn Thr His Tyr Tyr Asp Lys Arg Trp Ser Cys 
305 310 315 320 

Glu Leu Phe Leu Leu Val Ser lie Ser Thr Ser Val He Leu Met. Gin 
325 330 335 

His Leu Leu Pro Ala Ser Tyr Cys Asp Leu Leu His Lys Ala Ala Ala 
340 345 350 

His Leu Ala Cys Trp Gin Lys Val Asp Pro Ala Leu Cys Ser Asn Val 
355 360 365 

Leu Gin Pro Pro Trp Thr Glu Glu Cys Met Trp Pro Gin Gly Val Leu 
370 375 380 

Val Lys His Ser Lys Asn Val Tyr Lys Ala Val Gly His Tyr Asn Val 
385 390 395 400 

Ala Ser Pro Pro Met Ser Pro Thr Ser Ala Ser He Ser Phe Ser Ala 
405 410 415 

Thr Pro Cys Gly Ser Ser Thr Ser Phe Cys Cys Trp Arg Val Pro Ser 
420 425 430 



Leu Ser Thr Ser Cys Thr Pro 
435 



<210> 61 

<211> 3071 

<212> DNA 

<213> Mus musculus 



<400> 61 

gtgaaagcag 

tccaccggct 

gaatggccat 

gggctgctgc 

ggggcaagct 

ccaccgtgaa 

agaccaaggc 

agggcaactg 

aggacacagc 

gtgtgtccct 

atgagagtcc 

sgttcaactt 

ataccacccc 

ccaatgtagc 

agactccagg 

gttgcagttt 

tccgctgcgg 

ggaacccctg 

grgaaaacta 



cagrgcgcct 
gctgctgtca 
ctcaactggt 
tga^tcacag 
gagtgccgct 
gagtgaggag 
acccttggaa 
tacgtaccat 
ttattcaaac 
catactggac 
ctgtgggacc 
caaaggcatg 
tttccaggcc 
ttgtggggat 
aatatttcac 
caacaacggg 
ctgccggcct 
cagctcaaac 
cacrtgccgt 



ctgctccctt 
ccgcagggga 
ttgttcctgc 
gctgtggtgt 
gaagcccagc 
gaggcccggc 
gcaaagatgg 
gatttgccaa 
tggtacaaag 
ctgtccttga 
cccgaagctc 
tgtaggccac 
actacctcct 
gaagctaaga 
tggggcagct 
ggctgccagc 
ggatttcgac 
ccatgcacag 
tgtcccagcg 



cagagcacag cctggtgtca 
gtctagcccc tcccagaagg 
tgctggggct ccttggccag 
gcgaggggac tgcctgctat 
atcgctgcaa tgagaatgga 
atgttcagca agccctgact 
gcaaattctg gatcgggctc 
tgaggggctt cagctgggtg 
ccagcaagag ctcctgtatc 
cacctcaccc cagccatctg 
caggtaacag cattgaaggt 
tggcgctggg tggtccaggg 
ctctggaggc tgtgcctttt 
gtgaaaccca ctatttccta 
caggcccact ctgtgtcagc 
aggattgctt cgaaggtggc 
tgctggatga tctagtaact 
gaggtggcat gtgccattct 
gctaccagct ggactctagc 
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aggtccaggt 
agacacagaa 
ccctgggcag 
acagcccatt 
ggcaarcttg 
cagctcctga 
cagcgagaga 
ggtggtggag 
tttaaacgct 
cccaagtggc 
ttcctgtgca 
cgggcgacct 
gcctcrgtag 
tgcaatgaaa 
cccaagtttg 
gatggctcct 
tgtgcctcca 
gtaccactca 
caagtgcact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



WO 00/11168 



PCT/US99/19052 



gtgtggatat 
ggagcttcca 
gtgaagatgt 
ctgatggctc 
cccagtgtga 
gcttcaacac 
atggggtctt 
aggaagacaa 
ctagtggctc 
atattcccac 
ggttcgagtt 
attctgtgtc 
ggacggtggt 
aacggagagc 
attcctgggt 
cagggacaga 
accttcttag 
aaaaattgtg 
gtgcctttga 
gtggatattg 
tggttcctaa 
caaagttttt 
gaaacagttc 
atataccccc 
gttctcaaca 
tgtctgcaat 
aagttcattt 
ttgtcagggg 
tattttaaaa 
ggcaagagtg 
caggttactc 
tttaaaaagt 
ctctgaaaaa 



agatgagi. 

ctgrgaatgt 

ggatgagtgt 

t-tctactgc 

ggatatagat 

agacggttcc 

ttgtagcagg 

cgatgacaga 

taaggatgtc 

tgcctctgtt 

gggcacatac 

tccacacagt 

ggccatctca 

caagaaggag 

tccagagcga 

ctgctgaaga 

argaggggga 

caccttccac 

tatgtctggg 

acagacctca 

tragggggaa 

tcttatcact 

crttccatca 

agtgacccct 

gtacccatct 

gtggcaaatg 

gcagatcagc 

ttattgtaaa 

tatgtgcgtt 

actt tagtct 

tcctccttag 

gggagctatg 

a 



caggactccc 
tgggttggtt 
gcagctgcca 
tcctgtaaag 
gagtgttcgg 
ttcaggtgtg 
ggcactgtgt 
aaggagagta 
tccaacagag 
ccactagaaa 
ctccccacga 
gacaccgatg 
ctcttgctgg 
gagataaaag 
gcagagagcc 
ctatgtggcc 
agccacatca 
ttaagaacct 
ggtgaaagct 
ctcaaacacc 
aggagtcaga 
tgatttatca 
caggacagac 
taggcaagga 
atttcaggtg 
gcttcttttt 
gattaagtct 
tgttagtaat 
tgaaatctgt 
ttcagtgaat 
agctacaaca 
tcatgctttg 



cctgtgccca 
accaacccag 
actcgccctg 
agggctatat 
acgcaagggg 
gctgcccgcc 
tttctgaact 
ctatgcctcc 
cacagacaac 
tagaaatccc 
cctccggcca 
ggcagaacct 
tgctggccct 
agaagaagcc 
aagccccgga 
ttagagacag 
ttctgaatga 
ggtggcttgg 
gtgtgttggt 
cttttcaaat 
ggggtgggac 
tcgaaggagt 
acacctcaat 
cttgttgaga 
ggatctctgg 
gcatttttac 
gaattgtgtg 
ttcgcctcaa 
ttcatgcatc 
gttgcccaga 
taacattctg 
agctttctgt 



ggattgtgtc 
tggccccaag 
tgcccaaggc 
tgtgtctggg 
caatccatgt 
aggctgggag 
accagccagg 
tactgaaatg 
aggtctcttc 
tagtgaagta 
cagcaagccg 
gcttctgttt 
agggattctc 
tcagaatgca 
gaatcagtac 
ctgccactac 
cttgactgga 
gataggcagg 
ttgtcattgt 
ccaatagcaa 
agggtggggg 
tactggtgct 
cctccatggg 
actgcatcac 
acctttcctc 
tccgccccca 
gtggtcagtc 
gccctcagta 
ctggaactgt 
atgtgggtag 
aggggagtca 
taagcacctc 



ac itctag 
gaagaggcct 
tgcatcaaca 
gaagacagta 
gattccctgt 
ctggctccca 
cctccccaaa 
cccagttctc 
gtccaatcag 
tctgatgtct 
acacatgaag 
tacatcctgg 
atttatcata 
gccgacagct 
agcccaacac 
cttcagagct 
ctctcagcaa 
tattttcttg 
ggggagtttt 
ctggttcctc 
gatggggctt 
aattacaatg 
gtcaacaact 
attttgacct 
cttcccatct 
ccccaagctg 
ttgtttcctt 
agaacataaa 
gggatgctca 
ggaaggctca 
cagggttgcc 
tcctaataaa 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3071 



<210> 62 
<211> 644 
<212> PRT 

<213> Mus musculus 



<400> 62 
Met Ala lie 
1 



Ser Thr Gly Leu Phe Leu Leu Leu Gly Leu Leu Gly Gin 
5 10 15 



Pro Trp Ala Gly Ala Ala Ala Asp Ser Gin Ala Val Val Cys Glu Gly 
20 25 30 



Thr Ala Cys 
35 



:yr Thr Ala His Trp Gly Lys Lea Ser Ala Ala Glu Ala 
40 45 



Gin His Arg 
50 



:ys Asn Glu Asn Gly Gly Asn Leu Ala Thr Val Lys Ser 
55 60 



Glu Glu Glu 
65 



Ala Arg His Val Gin Gin Ala Leu Thr Gin Leu Leu Lys 
70 75 80 



Thr Lys Ala Pro Leu Glu Ala Lys Met Gly Lys Phe Trp lie Gly Leu 

85 90 95 

Gin Arg Glu Lys Gly Asn Cys Thr Tyr His Asp Leu Pro Met Arg Gly 

100 105 110 



Phe Ser Trp 
115 



Vai Gly Gly Gly Glu Asp Thr Ala Tyr Ser Asn Trp Tyr 
120 125 
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Lys 



Ala 
130 



Ser 



Lys 



Se: 



er 



Cys 
135 



Phe 



Lys 



Arg 



Cys 
140 



Val 



Ser 



lie 



Leu Asp Leu Ser Leu Thr Pro His Pro Ser His Leu Pro Lys Trp His 
145 150 155 160 

Glu Ser Pro Cys Gly Thr Pro Glu Ala Pro Gly Asn Ser lie Glu Gly 
165 170 175 

Phe Leu Cys Lys Phe Asn Phe Lys Gly Met Cys Arg Pro Leu Ala Leu 
180 * 185 190 

Gly Gly Pro Gly Arg Val Thr Tyr Thr Thr Pro Phe Gin Ala Thr Thr 
195 200 205 

Ser Ser Leu Glu Ala Val Pro Phe Ala Ser Val Ala Asn Val Ala Cys 
210 215 220 

Gly Asp Glu Ala Lys Ser Glu Thr His Tyr Phe Leu Cys Asn Glu Lys 
225 230 235 240 

Thr Pro Gly lie Phe His Trp Gly Ser Ser Gly Pro Leu Cys Val Ser 
245 250 255 

Pro Lys Phe Gly Cys Ser Phe Asn Asn Gly Gly Cys Gin Gin Asp Cys 
260 265 270 

Phe Glu Gly Gly Asp Gly Ser Phe Arg Cys Gly Cys Arg Pro Gly Phe 
275 280 285 

Arg Leu Leu Asp Asp Leu Val Thr Cys Ala Ser Arg Asn Pro Cys Ser 
290 295 300 

Ser Asn Pro Cys Thr Gly Gly Gly Met Cys His Ser Val Pro Leu Ser 
305 310 315 320 

Glu Asn Tyr Thr Cys Arg Cys Pro Ser Gly Tyr Gin Leu Asp Ser Ser 
325 330 335 

Gin Val His Cys Val Asp He Asp Glu Cys Gin Asp Ser Pro Cys Ala 
340 345 350 

Gin Asp Cys Val Asn Thr Leu Gly Ser Phe His Cys Glu Cys Trp Val 
355 360 365 

Gly Tyr Gin Pro Ser Gly Pro Lys Glu Glu Ala Cys Glu Asp Val Asp 
370 375 380 

Glu Cys Ala Ala Ala Asn Ser Pro Cys Ala Gin Gly Cys He Asn Thr 
385 390 395 400 

Asp Gly Ser Phe Tyr Cys Ser Cys Lys Glu Gly Tyr He Val Ser Gly 
405 410 415 

Glu Asp Ser Thr Gin Cys Glu Asp He Asp Glu Cys Ser Asp Ala Arg 
420 425 430 

Gly Asn Pro Cys Asp Ser Leu Cys Phe Asn Thr Asp Gly Ser Phe Arg 
435 440 445 

Cys Gly Cys Pro Pro Gly Trp Glu Leu Ala Pro Asn Gly Val Phe Cys 
450 455 460 

Ser Arg Gly Thr Val Phe Ser Glu Leu Pro Ala Arg Pro Pro Gin Lys 
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465 



70 



475 



Glu Asp Asn Asd Asp Arg Lys Glu Ser Thr Met Pro Pro Thr Glu 
485 490 495 

Pro Ser Ser Pro Ser Gly Ser Lys Asp Val Ser Asn Arg Ala Gin 
500 505 510 

Thr Gly Leu Phe Val Gin Ser Asp He Pro Thr Ala Ser Val Pro 
515 520 525 

Glu He Glu He Pro Ser Glu Val Ser Asp Val Trp Phe Glu Leu 
530 535 540 

Thr Tyr Leu Pro Thr Thr Ser Gly His Ser Lys Pro Thr His Glu 
545 550 555 

Ser Val Ser Ala His Ser Asp Thr Asp Gly Gin Asn Leu Leu Leu 
565 570 575 

Tyr He Leu Gly Thr Val Val Ala He Ser Leu Leu Leu Val Leu 
580 585 590 

Leu Gly He Leu He Tyr His Lys Arg Arg Ala Lys Lys Glu Glu 
595 600 605 

Lys Glu Lys Lys Pro Gin Asn Ala Ala Asp Ser Tyr Ser Trp Val 
610 615 620 

Glu Arg Ala Glu Ser Gin Ala Pro Glu Asn Gin Tyr Ser Pro Thr 
625 630 635 



480 

y.et 

Thr 



Gly 



Asp 
560 



He 



Pro 
640 



Gly Thr Asp Cys 



<210> 63 

<211> 499 

<212> DNA 

<213> Mus musculus 



<400> 63 

cccattactt 

attactgtat 

aaagacagag 

cactggactc 

taacatgatt 

attttagagg 

ccaaggcaat 

taatgtcgca 

atgtgttctg 



ccacctccca 
ggaaatggag 
ggatcttggg 
ctacaggaac 
ttcacattgt 
tgtgatgcct 
aaaagagaat 
gtacaactct 
ctcgaaaaa 



tgaggactgc 
gagtgagaaa 
ccatgagtca 
aagaaatgaa 
ctaccctacc 
cagctgcaat 
ggaccaaaga 
tcaaaagaaa 



acctgcggca 
gtgtgggaat 
cagcaaggaa 
ctatcgtgtt 
taatagagca 
gggtgagaac 
cagtctgtca 
aagaaaaaac 



gcgattttat 
cataaaagaa 
agcctcccac 
cagcccctga 
gagatgtaaa 
tactcctcat 
tcacatctag 
aagaaaaaag 



gtgaacttgg 
ggcagaacca 
caaacatctg 
gacttgggct 
tattattctt 
ttattctctt 
tcaaaagagc 
taataaacag 



60 

120 

1B0 

240 

300 

360 

420 

480 

499 



<210> 64 
<211> 89 
<212> PRT 

<213> Mus musculus 



<400> 64 

Met Arg Thr Ala Pro Ala Ala Ala He Leu Cys Glu Leu Gly Leu Leu 
15 10 15 

Tyr Gly Asn Gly Gly Val Arg Lys Cys Gly Asn His Lys Arg Arg Gin 
20 25 30 
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Asn Gin Lys Thr Glu Gly Ser Trp Ala Met Ser His Ser Lys Glu Ser 
35 40 45 

Leu Pro Pro Asn He Cys Thr Gly Leu Leu Gin Glu Gin Glu Met Asn 
50 55 60 

Tyr Arg Val Gin Pro Leu Arg Leu Gly Leu Asn Met lie Phe Thr Leu 
65 70 75 80 

Ser Thr Leu Pro Asn Arg Aia Glu Met 
35 



<210> 65 
<211> 3857 
<212> DNA 

<213> Mus musculus 
<400> 65 

agagggtaga cagaaaggcg ggaaagggct gtgaggtcaa atggacccca tggaactgaa 60 
caacgtcagc atcgaacccg acggagacag ctgcagcggg gacagtattc aggacagc-a 120 
caccggcatg gaaaac-ccg acaaggacgc catgaacagc caatttgcta atgaagatgc 180 
cgaaagtcag aagttcctga caaatgggtt tttagggaag aagaagctag ccgattacgc 240 
ggatgagcat caccctggaa tgacttcctt tggaatgtcc tcatttaacc tgagcaacgc 300 
catcatgggc agtgggatct taggcttgtc ctatgccatg gccaacaccg ggatcatcct 360 
ttttataatc atgctgctta ctgtggcaat actctcgctc tactcggttc accttttgzt 420 
gaagacagcc aaggaaggag ggtctctaat ctatgaaaaa ttgggcgaga aagcattrgg 480 
atggcctggg aaaattggag ccttcatctc tattacaatg cagaacattg gagccatgtc 540 
aagctacctc ttcatcarta agtacgaact gcctgaagta atcagagcat tcatgggact 600 
tgaagaaaac actggggaat ggtacctcaa cggcaactac ctcgtcttat ttgtgtccgt 660 
ggggatcatc ctcccgctct ctctccttaa aaatttaggc taccttggct acaccagtgg 720 
attt-ctctc tcctgcatgg tgtttttcgt cagtgtggtg atttacaaaa aattccaaat 780 
tccczgccct ctgcctgctc tggaccacaa caacggaaat ctgacgttca acaacacact 840 
tccgattcac atgatctccc tgcctaatga ctcggagagc tcgggtgtga acttcatgat 900 
ggattacgct caccacaacc cagctgggct ggatgagaag caggtcgcag gccctcttca 960 
cagcaatggc gtggag^acg aagcccaggg tgctgagaaa tgccaaccaa aatactttg- 1020 
gttcaattcc cggacggsct atgcaatccc aatcctggct tttgcttttg tctgccaccc 1080 
tgaggtcctt cccatc-aca gcgagcttaa agatcgatcc cgcagaaaga tgcagacggt 1140 
gtccaacatt tccatcicag gca-gctcgt catgtacctt cttgcggccc tctttggt.a 1200 
tctgagcttc tacggggacg ttgaagacga gctgctgcat gcttacagca aggtctacac 1260 
atttgatacg gctcrcctca tggtgcgcct ggcagtcctg gtggcagtga cactgaccg- 1320 
gcccatcgtg ctgttcccga tccgtacttc ggtgatcaca ctgctgtttc caaggaaacc 1380 
cttcagctgg ctgaagcatt tcgggatcgc tgcaatcatc atcgcactca acaacatccr 1440 
ggtcatcctc gtgccta=ea tcaaatacat ctttggattc ataggggctt cttctgccac 1500 
tatgctgatt ttcattct-c cggctgcgtt ttatctcaag ctcgtcaaga aagaacctct 1560 
aagatcaccc cagaaga-^g gggctttggt cttccttgtg actggaatta ttttcatgat 1620 
gggaagcatg gcgctcarta tactcgactg gatctacaac ccgccgaatc ccaatcacca 1680 
ctaatcccgg ggagacgcgt ctccactgga aacagctgaa attgtctgaa ggacatttta 1740 
gttgtcttga ttggga-gtt agtctgagga attagcaaga ttccaaagac gtttttccag 1800 
ctctatcagc acacatttta acccaggccg tgcagtgcag tgtgtgatgc ccgagttg~g 1860 
tttgcagcag ctgtgcaagc tgaagcctgt tggctgcgtg tgttggtcag cagacaatag 1920 
cctgtzcccc catggtca=t ccacttctct ccacccccag attaacaggt aattctactc 1980 
tcagaacatc agacaaagac ctcctggttg ggatacttgt ggaagagaaa attatgggtt 2040 
ttgtzgggaa tggttttgtt gggaatggtg aaggatgcat taaaaattct gtgcgaagta 2100 
tcatcagtta cggccatcrc tcactctaca ccaacactaa gggtcggttg actagctgag 2160 
gcagggggat atcttgggct gtccctgtga ggatcatgac gtatgacggt tgccagtata 2220 
gagtacrtca tttcaatac- caaggaatag tttgcccaac ctgcttatta caccgagt~a 2280 
gtgaattttc attctrtt-a ttttttggta tggcaaatgt tcaaacatct cactaaacat 2340 
agaggggggt tatttagtrt atcatgaaac aactaccaaa aagtaatggt ttttaacatc 2400 
tgccttttca tgttgttrct gatatctttg ttccactttg tctttgaaca aggcttcccc 2460 
tctcgtcttc cgttctcact cttcgtaatt ttcagaaaaa ccacagtccg tgtgggagac 2520 
acac:a:ccc agtattgt-t gatacatctc -tatttgataa acattcagcg caggaaactg 2580 
tgattttgct atatgtt.gt gtacataatc tcattctgca gttatcagaa cgttgacata 2640 
tgggacattg gatttt.a-t ttttacatat gtaggttttt tttcttcaca gacaaaatg- 2700 
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ttatatcatc 
atctgtccat 
cttcactgtt 
aattttcagt 
tttagtttac 
ggaataaatg 
ggatggagtt 
ttactggttg 
aggtagtgat 
tttgtttact 
aattgagcat 
aatcaccaaa 
agatagaaag 
aatgtgaata 
cggatcttta 
agagcaatat 
tttcaggatt 
aatttaaccc 
tcaatgcact 
ttttcaaatg 



agggtggggc 
tggagacatc 
tacaagacta 
gctaacaatg 
atcggaactt 
caccaacatg 
cagactccgc 
tacattttac 
atactgtcac 
gtgtcatttt 
tactagaaac 
taggatcttt 
cagaaagggt 
taaaagatta 
cacttagagc 
cacagtgcca 
agagttcaca 
atttgtaaac 
ggggggattt 
aaaaaaa 



agggaattaa 
taaaaattaa 
taatcatctc 
gtctttgaat 
ggtgacattc 
acccaaaacc 
ccactgccta 
atttcagtaa 
accaaagcag 
ctggacctta 
taccataact 
aagatgtctt 
gtgctgcaaa 
atccaaaaat 
agctaacctg 
aagacctcat 
cagactccag 
agatacaaat 
ttttcattac 



gctggtgggc 
agtcaaagtt 
agcaaagtag 
ttaacatgtc 
atttgccatt 
acattgctca 
gaatctctgg 
aaagccaagt 
tttttcaaca 
agactataca 
tcggagrtat 
tctattttca 
tattaaatta 
attggaaacc 
gaagtatcca 
tcacacagca 
ggcagatact 
tttattttct 
agactgtgta 



tcaaaaatcc 
gtgcatagtt 
cgaaggaagg 
tagaataggg 
aatagaacat 
catccatgga 
cattgacttc 
ctgaaatgca 
gtgtgcctaa 
atggtgtgct 
tatcatttgg 
ttcatccaag 
ttattttatc 
tgtgaaatct 
cttgtgtagt 
ccgatcattt 
cagaataccg 
tacatcattt 
ttggtatata 



atgggtactt 
cagtaacgct 
gtttgtaaat 
tttagttcat 
cgtgtggtta 
cgcttacacg 
tttttttcaa 
agtatcctac 
gaaacatata 
ttgtgataga 
gagggaaaaa 
aaataaatca 
tgaacatatt 
gtacgtattt 
tagatattca 
gtgtggccgt 
ttttctggta 
ataagacggt 
ataaataaac 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3857 



<210> 66 
<211> 547 
<212> PRT 

<213> Mus musculus 
<400> 66 

Met Asp Pro Met Glu Leu Asn Asn Val Ser lie Glu Pro Asp Gly Asp 
1 5 10 15' 

Ser Cys Ser Gly Asp Ser lie Gin Asp Ser Tyr Thr Gly Met Glu Asn 
20 25 30 

Ser Asp Lys Asp Ala Met Asn Ser Gin Phe Ala Asn Glu Asp Ala Glu 
35 40 45 

Ser Gin Lys Phe Leu Thr Asr. Gly Phe Leu Gly Lys Lys Lys Leu Ala 
50 55 60 

Asp Tyr Ala Asp Glu His His Pro Gly Met Thr Ser Phe Gly Met Ser 
65 70 75 80 

Ser Phe Asn Leu Ser Asn Ala lie Met Gly Ser Gly He Leu Gly Leu 
85 90 95 

Ser Tyr Ala Met Ala Asn Thr Gly He He Leu Phe He He Met Leu 
100 105 110 

Leu Thr Val Ala He Leu Ser Leu Tyr Ser Val His Leu Leu Leu Lys 
115 120 125 

Thr Ala Lys Glu Gly Gly Ser Leu He Tyr Glu Lys Leu Gly Glu Lys 
130 135 140 

Ala Phe Gly Trp Pro Gly Lys He Gly Ala Phe He Ser He Thr Met 
145 150 155 160 

Gin Asn He Gly Ala Met Ser Ser Tyr Leu Phe He He Lys Tyr Glu 
165 170 175 

Leu Pro Glu Val He Arg Ala Phe Met Gly Leu Glu Glu Asn Thr Gly 
180 185 190 



Glu Trp Tyr Leu Asn Gly Asn Tyr Leu Val Leu Phe Val Ser Val Gly 
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195 



200 



205 



lie lie Leu Pro Leu Ser Leu Leu Lys Asn Leu Gly Tyr Leu Gly Tyr 
210 215 220 

Thr Ser Gly Phe Ser Leu Ser Cys Met Val Phe Phe Val Ser Val Val 
225 230 235 240 

He Tyr Lys Lys Phe Gin He Pro Cys Pro Leu Pro Ala Leu Asp His 
245 250 255 

Asn Asn Gly Asn Leu Thr Phe Asn Asn Thr Leu Pro He His Met He 
260 265 270 

Ser Leu Pro Asn Asp Ser Glu Ser Ser Gly Val Asn Phe Met Met Asp 
275 280 285 

Tyr Ala His His Asn Pro Ala Giv Leu Asp Glu Lys Gin Val Ala- Gly 
290 295 300 

Pro Leu His Ser Asn Gly Val Glu Tyr Glu Ala Gin Gly Ala Glu Lys 
305 310 315 320 

Cys Gin Pro Lys Tyr Phe Val Phe Asn Ser Arg Thr Ala Tyr Ala He 
325 330 335 

Pro He Leu Ala Phe Ala Phe Val Cys His Pro Glu Val Leu Pro He 
340 345 350 

Tyr Ser Glu Leu Lys Asp Arg Ser Arg Arg Lys Met Gin Thr Val Ser 
355 360 365 

Asn He Ser He Ser Gly Met Leu Val Met Tyr Leu Leu Ala Ala Leu 
370 375 380 

Phe Gly Tyr Leu Ser Phe Tyr Gly Asp Val Glu Asp Glu Leu Leu His 
365 390 395 400 

Ala Tyr Ser Lys Val Tyr Thr Phe Asp Thr Ala Leu Leu Met Val Arg 
405 410 415 

Leu Ala Val Leu Val Ala Val Thr Leu Thr Val Pro He Val Leu' Phe 
420 425 430 

Pro He Arg Thr Ser Val He Thr Leu Leu Phe Pro Arg Lys Pro Phe 
435 440 445 

Ser Trp Leu Lys His Phe Gly lie Ala Ala He He He Ala Leu Asn 
450 455 460 

Asn He Leu Val He Leu Val Pro Thr He Lys Tyr He Phe Gly Phe 
465 470 475 480 

He Gly Ala Ser Ser Ala Thr Met Leu He Phe He Leu Pro Ala Ala 
485 490 495 

Phe Tyr Leu Lys Leu Val Lys Lys Glu Pro Leu Arg Ser Pro Gin Lys 
500 505 510 

He Gly Ala Leu Val Phe Leu Val Thr Gly He He Phe Met Met Gly 
515 520 525 



Ser Met Ala Leu He He Leu Asp Trp He Tyr Asn Pro Pro Asn Pro 
530 535 * 540 

Page 42 of 56 



WO 00/11168 



PCT/US99/19052 



Asn His His 
545 



<210> 67 
<211> 1070 
<212> DNA 

<213> Mus musculus 
<400> 67 

agcgacatgg ccccgcccgc gctccaggcc cagcctccag gcggctctca actgaggttc 60 

ctgctgttcc tgctgctgtt gctgctgctg ctgtcatggc catcgcaggg ggacgccctg 120 

gcaatgcctg aacagcgacc ctccggccct gagtcccaac tcaacgccga cgagctacgg 180 

ggtcgcttcc aggacctgct gagccggctg catgccaacc agagccgaga ggactcgaac 240 

tcagaaccaa gtcctgaccc agctgtccgg atactcagtc cagaggtgag attggggtcc 300 

cacggccagc tgctactccg cgtcaaccgg gcgtcgctga gtcagggtct ccccgaagcc 360 

taccgcgtgc accgagcgct gctcctgctg acgccgacgg cccgcccctg ggacatcact 420 

aggcccctga agcgtgcgct cagcctccgg ggaccccgtg ctcccgcatt acgcctgcgc 480 

ctgacgccgc ctccggacct ggctatgctg ccctctggcg gcacgcagct ggaactgcgc 540 

ttacgggtag ccgccggcag ggggcgccga agcgcgcatg cgcacccaag agactcgtgc 600 

ccactgggtc cggggcgctg ctgtcacttg gagactgtgc aggcaactct tgaagacttg 660 

ggctggagcg actgggtgct gtccccgcgc cagctgcagc tgagcatgtg cgtgggcgag 720 

tgcccccacc tgtatcgctc cgcgaacacg catgcgcaga tcaaagcacg cctgcacggc 780 

ctgcagcctg acaaggtgcc tgccccgtgc tgtgtcccct ccagctacac cccggtggtt 840 

cttatgcaca ggacagacag tggtgtgtca ctgcagactt atgatgacct ggtggcccgg 900 

ggctgccact gcgcttgagc accgggccct gctcctcacc tacactcccc ttcaaggatg 960 

ctatttatat ttgtatttat taatattatt aatttattgg ggtcgggctg ggtggatgga 1020 

ttgtgtattt atttaaaact ctgctaataa aggtgagctt ggtttcaaaa 1070 

<210> 68 
<211> 298 
<212> PRT 

<213> Mus musculus 
<400> 68 

Met Ala Pro Pro Ala Leu Gin Ala Gin Pro Pro Gly Gly Ser Gin Leu 
15 10 15 

Arg Phe Leu Leu Phe Leu Leu Leu Leu Leu Leu Leu Leu Ser Trp Fro 
20 25 30 

Ser Gin Gly Asp Ala Leu Ala Met Pro Glu Gin Arg Pro Ser Gly Pro 
35 40 45 

Glu Ser Gin Leu Asn Ala Asp Glu Leu Arg Gly Arg Phe Gin Asp Leu 
50 55 60 

Leu Ser Arg Leu His Ala Asn Gin Ser Arg Glu Asp Ser Asn Ser Glu 
65 70 75 80 

Pro Ser Pro Asd Pro Ala Val Arg lie Leu Ser Pro Glu Val Arg Leu 
85 90 95 

Gly Ser His Gly Gin Leu Leu Leu Arg Val Asn Arg Ala Ser Leu Ser 
100 105 110 

Gin Gly Leu Pro Glu Ala Tyr Arg Val His Arg Ala Leu Leu Leu Leu 
115 120 125 

Thr Pro Thr Ala Arg Pro Trp Asp He Thr Arg Pro Leu Lys Arg Ala 
130 135 140 

Leu Ser Leu Arg Gly Pro Arg Ala Pro Ala Leu Arg Leu Arg Leu Thr 
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145 



50 



155 



160 



Pro Pro Pro Aso Leu Ala Met Leu Pro Ser Gly Gly Thr Gin Leu Glu 
165 170 175 

Leu Arg Leu Arg Val Ala Ala Gly Arg Gly Arg Arg Ser Ala His Ala 
180 185 190 

His Pro Arg Asd Ser Cys Pro Leu Gly Pro Gly Arg Cys Cys His- Leu 
195 200 205 

Glu Thr Val Gin Ala Thr Leu Glu Asp Leu Gly Trp Ser Asp Trp Val 
210 215 220 

Leu Ser Pro Arg Gin Leu Gin Leu Ser Met Cys Val Gly Glu Cys Pro 
225 * 230 235 240 

His Leu Tyr Arg Ser Ala Asn Thr His Ala Gin He Lys Ala Arg Leu 
245 250 255 

His Gly Leu Gin Pro Asp Lys Val Pro Ala Pro Cys Cys Val Pro Ser 
260 265 270 

Ser Tyr Thr Pro Val Val Leu Met His Arg Thr Asp Ser Gly Val Ser 
275 280 285 



Leu Gin Thr Tyr Asp Asp Leu Val Ala Arg 
290 295 



<210> 69 
<211> 134 
<212> PRT 

<213> Mus musculus 
<400> 69 

Ala Lys Tyr Val Cys Leu Ala Asn Lys Asn Cys Pro Val Asp Lys Arg 
15 10 15 

Arg Arg Asn Arg Cys Gin Tyr Cys Arg Phe Gin Lys Cys Leu Ala Val 
20 25 30 

Gly Met Val Lys Glu Val Val Arg Thr Asp Ser Leu Lys Gly Arg Arg 
35 40 45 

Gly Arg Leu Pro Ser Lys Pro Lys Ser Pro Gin Glu Pro Ser Pro Pro 
50 55 60 

Ser Pro Pro Phe Gin Ala Asn Pro Asp Tyr Gin Met Ser Gly Asp Asp 
65 70 75 80 

Thr Gin His lie Gin Gin Phe Tyr Asp Leu Leu Thr Gly Ser Met Glu 
85 90 95 

He He Arg Gly Trp Ala Glu Lys He Pro Gly Phe Ala Asp Leu Pro 
100 105 110 

Lys Ala Asp Gin Asp Leu Leu Phe Glu Ser Ala Phe Leu Glu Leu phe 
115 120 125 

Val Leu Arg Leu Ala Tyr 
130 
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<210> 70 

<211> 140 

<212> PRT 

<213> Mus musculus 

<400> 70 

Ser Gin Thr Arg Gin Gin Gly Pro Leu Arg Ser He Met Lys Asp Leu 
15 10 15 

His Ser Asp Asp Asn Glu Glu Glu Ser Asp Glu Val Glu Asp Asn Asp 
20 25 30 

Asn Asp Ser Glu Met Glu Arg Pro Val Asn Arg Gly Xaa Ser Arg Ser 
35 40 45 

Arg Arg Val Ser Leu Ser Asp Gly Ser Asp Ser Glu Ser Ser Ser Ala 
50 55 60 

Ser Ser Pro Leu His His Glu Pro Pro Pro Pro Leu Leu Lys Thr Asn 
65 70 75 80 

Asn Asn Gin He Leu Glu Val Lys Ser Pro He Lys Gin Ser Lys Ser 
85 90 95 

Asp Lys Gin He Lys Asn Gly Glu Cys Asp Lys Ala Tyr Leu Asp Glu 
100 105 HO 

Leu Val Glu Leu Pro Arg Xaa Xaa Met Thr Leu Arg Glu Arg His He 
115 120 125 

Leu Gin Gin He Val Asn Leu Xaa Glu Glu Thr Gly 
130 135 140 



<210> 71 

<211> 89 

<212> PRT 

<213> Mus musculus 

<400> 71 

Met Arg Thr Ala Pro Ala Ala Ala He Leu Cys Glu Leu Gly Leu Leu 
15 10 15 

Tyr Gly Asn Gly Gly Val Arg Lys Cys Gly Asn His Lys Arg Arg Gin 
20 25 30 

Asn Gin Lys Thr Glu Gly Ser Trp Ala Met Ser His Ser Lys Glu Ser 
35 40 45 

Leu Pro Pro Asn He Cys Thr Gly Leu Leu Gin Glu Gin Glu Met Asn 
50 55 60 

Tyr Arg Val Gin Pro Leu Arg Leu Gly Leu Asn Met He Phe Thr Leu 
65 70 75 80 

Ser Thr Leu Pro Asn Arg Ala Glu Met 
85 



<210> 72 

<211> 3121 

<212> DNA 

<213> Homo sapiens 

Page 45 of 56 



WO 00/11168 



PCT/US99/19052 



<400> 72 

gaaagcagca gtgcgcctct gctcccttca gagcacagcc tggtgtcaag gtccaggttc 60 

caccggctgc tgctgtcacc gcaggggagt ctagcccctc ccagaaggag acacagaaga 120 

atggccatct caactggttt gttcctgctg ctggggctcc ttggccagcc ctgggcaggg 180 

gctgctgctg attcacaggc tgtggtgtgc gaggggactg cctgctatac agcccattgg 240 

ggcaagctga gtgccgctga agcccagcat cgctgcaatg agaatggagg caatcttgcc 300 

accgtgaaga gtgaggagga ggcccggcat gttcagcaag ccctgactca gctcctgaag 360 

accaaggcac ccttggaagc aaagatgggc aaattctgga tcgggctcca gcgagagaag 420 

ggcaactgta cgtaccatga tttgccaatg aggggcttca gctgggtggg tggtggagag 480 

gacacagctt attcaaactg gtacaaagcc agcaagagct cctgtatctt taaacgctgt 540 

gcgtccctca tactggacct gtccttgaca cctcacccca gccatctgcc caagtggcat 600 

gagagtccct gtgggacccc cgaagctcca ggtaacagca ttgaaggttt cctgtgcaag 660 

ttcaacttca aaggcatgtg taggccactg gcgctgggtg gtccagggcg ggtgacctat 720 

accacccctt tccaggccac tacctcctct ctggaggctg tgccttttgc ctctgtagcc 780 

aatgtagctt gtggggatga agctaagagt gaaacccact atttcctatg caatgaaaag 84 0 

actccaggaa tatttcactg gggcagctca ggcccactct gtgtcagccc caagtttggt 900 

tgcagtttca acaacggggg ctgccagcag gattgcttcg aaggtggcga tggctccttc 960 

cgccgcggct gccggcctgg atttcgactg ctggatgatc tagtaacttg tgcctccagg 1020 

aacccctgca gctcaaaccc atgcacagga ggtggcatgt gccantctgt accactcagt 1080 

gaaaactaca cttgccgttg tcccagcggc taccagctgg actctagcca agtgcactgt 1140 

gtggatatag atgagtgcca ggactccccc tgtgcccagg attgtgtcaa cactctaggg 1200 

agcctccact gtgaatgttg ggttggttac caacccagtg gccccaagga agaggcctgt 12 60 

gaagatgtgg atgagtgtgc agctgccaac tcgccctgtg cccaaggctg catcaacact 1320 

gatggctctt tctactgctc ctgtaaagag ggctatattg tgtctgggga agacagtacc 1380 

cagtgtgagg atatagatga gtgttcggac gcaaggggca atccatgtga ttccctgtgc 1440 

ttcaacacag atggttcctt caggtgtggc tgcccgccag gctgggagct ggctcccaat 1500 

ggggtctttt gtagcagggg cactgtgttt tctgaactac cagccaggcc tccccaaaag 1560 

gaagacaacg atgacagaaa ggagagtact atgcctccta ctgaaatgcc cagttctcct 1620 

agtggctcta aggatgtctc caacagagca cagacaacag gtctcttcgt ccaatcagat 1680 

attcccactg cctctgttcc actagaaata gaaatcccta gtgaagtatc tgatgtctgg 1740 

ttcgagttgg gcacatacct ccccacgacc tccggccaca gcaagccgac acatgaagat 1800 

tctgtgtctg cacacagtga caccgatggg cagaacctgc ttctgtttta catcctgggg 1860 

acggtggtgg ccatctcact cttgctggtg ctggccctag ggattctcat ttatcataaa 1920 

cggagagcca agaaggagga gataaaagag aagaagcctc agaatgcagc cgacagctat 1980 

tcctgggttc cagagcgagc agagagccaa gccccggaga atcagcacag cccaacacca 2040 

gggacagact gctgaagact atgtggcctt agagacagct gccactacct tcagagctac 2100 

cttcttagat gagggggaag ccacatcatt ctgaatgact tgactggact ctcagcaaaa 2160 

aaattgtgca ccttccactt aagaacctgg tggcttggga taggcaggta ttttcttggt 2220 

gcctttgata tgtctggggg tgaaagctgt gtgttggttt gtcattgtgg ggagttttgt 2280 

ggatattgac agacctcact caaacaccct tttcaaatcc aatagcaact ggttcctctg 2340 

gctcctaatt agggggaaag gagtcagagg ggtgggacag ggtgggggga tggggcttca 2400 

aagttttttc ttatcacttg atttatcatc gaaggagtta ctggtgctaa ttacaatgga 24 60 

aacagttcct ttccatcaca ggacagacac acctcaatcc tccatggggt caacaactat 2520 

atacccccag tgacccctta ggcaaggact tgttgagaac tgcatcacat tttgacctgt 2580 

rctcaacagt acccatctat ttcaggtggg atctctggac ctttccccct tcccatcttg 2640 

tctgcaatgt ggcaaatggc ttctttttgc atttttactc cgcccccacc ccaagctgaa 2700 

gttcatttgc agatcagcga ttaagtctga attgtgtggt ggtcactctt gtttcctttt 2760 

gtcaggggtt attgtaaatg ttagtaattt cgcctcaagc cctcagcaag aacataaata 2820 

ttttaaaata tgtgcgtttg aaatctgttt catgcatcct ggaactgtgg gatgctcagg 2880 

caagagtgac tttagtcttt cagtgaatgt tgcccagaat gtgggtaggg aaggctcaca 2940 

ggttactctc ctccttagag ctacaacata acattctgag gggagtcaca gggttgcctt 3000 

taaaaagtgg gagctatgtc atgcttcgag ctttctgtta agcacctctc ctaataaact 3060 

ctgaaaaaat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3120 



3121 



<210> 73 

<211> 644 

<212> PRT 

<213> Homo sapiens 

<400> 73 

Met Ala lie Ser Thr Gly Leu Phe Leu Leu Leu Gly Leu Leu Gly Gin 
5 10 15 
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Pro Trp Ala Gly Al* la Ala Asp Ser Gin Ala Val Val Cys C Gly 
20 25 30 

Thr Ala Cys Tyr Thr Ala His Trp Gly Lys Leu Ser Ala Ala Glu Ala 
35 40 45 

Gin His Arg Cys Asn Glu Asn Gly Gly Asn Leu Ala Thr Val Lys Ser 
50 55 60 

Glu Glu Glu Ala Arg His Val Gin Gin Ala Leu Thr Gin Leu Leu Lys 
65 70 75 30 

Thr Lys Ala Pro Leu Glu Ala Lys Met Gly Lys Phe Trp He Gly Leu 
85 90 95 

Gin Arg Glu Lys Gly Asn Cys Thr Tyr His Asp Leu Pro Met Arg Gly 
100 105 110 

Phe Ser Trp Val Gly Gly Gly Glu Asp Thr Ala Tyr Ser Asn Trp Tyr 
115 120 125 

Lys Ala Ser Lys Ser Ser Cys He Phe Lys Arg Cys Val Ser Leu lie 
130 135 140 

Leu Asp Leu Ser Leu Thr Pro His Pro Ser His Leu Pro Lys Trp His 
145 150 155 160 

Glu Ser Pro Cys Gly Thr Pro Glu Ala Pro Gly Asn Ser He Glu Gly 
165 170 175 

Phe Leu Cys Lys Phe Asn Phe Lys Gly Met Cys Arg Pro Leu Ala Leu 
180 185 190 

Gly Gly Pro Gly Arg Val Thr Tyr Thr Thr Pro Phe Gin Ala Thr Thr 
195 200 205 

Ser Ser Leu Glu Ala Val Pro Phe Ala Ser Val Ala Asn Val Ala Cys 
210 215 220 

Gly Asp Glu Ala Lys Ser Glu Thr His Tyr Phe Leu Cys Asn Glu Lys 
225 230 235 240 

Thr Pro Gly He Phe His Trp Gly Ser Ser Gly Pro Leu Cys Val Ser 
245 250 255 

Pro Lys Phe Gly Cys Ser Phe Asn Asn Gly Gly Cys Gin Gin Asp Cys 
260 265 270 

Phe Glu Gly Gly Asp Gly Ser Phe Arg Cys Gly Cys Arg Pro Gly Phe 
275 280 285 

Arg Leu Leu Asp Asp Leu Val Thr Cys Ala Ser Arg Asn Pro Cys Ser 
290 295 300 

Ser Asn Pro Cys Thr Gly Gly Gly Met Cys His Ser Val Pro Leu Ser 
305 310 315 320 

Glu Asn Tyr Thr Cys Arg Cys Pro Ser Gly Tyr Gin Leu Asp Ser Ser 
325 330 335 

Gin Val His Cys Val Asp He Asp Glu Cys Gin Asp Ser Pro Cys Ala 
340 345 350 



Gin Asp Cys Val Asn Thr Leu Gly Ser Phe His Cys Glu Cys Trp Val 
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355 



360 



365 



Gly Tyr Gin Pro 
370 



Ser Gly Pro Lys Glu Glu Ala Cys Glu Asp Val Asp 
375 380 



Glu Cys Ala Ala 
385 



Ala Asn Ser Pro Cys Ala Gin Gly Cys lie Asn Thr 
390 395 400 



Asp Gly Ser Ph? 



Tyr Cys Ser Cys Lys Glu Gly Tyr lie Val Ser Gly 
405 410 415 



Glu Asp Ser Thr 
420 



Gin Cys Glu Asp He Asp Glu Cys Ser Asp Ala Arg 
425 430 



Gly Asn Pro Cys 
435 



Asp Ser Leu Cys Phe Asn Thr Asp Gly Ser Phe Arg 
440 445 



Cys Gly Cys Pre 
450 



Pro Gly Trp Glu Leu Ala Pro Asn Gly Val Phe Cys 
455 460 



Ser Arg Gly Thr 
465 



Val Phe Ser Glu Leu Pro Ala Arg Pro Pro Gin Lys 
470 475 480 



Glu Asp Asn Asp Asp Arg Lys Glu Ser Thr Met Pro Pro Thr Glu Met 
485 490 495 



Pro Ser Ser Pro 
500 



Ser Gly Ser Lys Asp Val Ser Asn Arg Ala Gin Thr 
505 510 



Thr Gly Leu Phe 
515 



Val Gin Ser Asp He Pro Thr Ala Ser Val Pro Leu 
520 525 



Glu He Glu He 
530 



Pro Ser Glu Val Ser Asp Val Trp Phe Glu Leu Gly 
535 540 



Thr Tyr Leu Pro 
545 



Thr Thr Ser Gly His Ser Lys Pro Thr His Glu Asp 
550 555 ' 560 



Ser Val Ser Als 



His Ser Asp Thr Asp Gly Gin Asn Leu Leu Leu Phe 
565 570 575 



Tyr He Leu Gly 
580 



Thr Val Val Ala He Ser Leu Leu Leu Val Leu Ala 
585 590 



Leu Gly He Le- 
595 



He Tyr His Lys Arg Arg Ala Lys Lys Glu Glu He 
600 605 



Lys Glu Lys Lvs 
610 



Pro Gin Asn Ala Ala Asp Ser Tyr Ser Trp Val Pro 
615 620 



Glu Arg Ala Glu 
625 



Ser Gin Ala Pro Glu Asn Gin Tyr Ser Pro Thr Pro 
630 635 640 



Gly Thr Asp Cys 



<210> 74 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Descrip-Lon of Artificial Sequence: Primer 
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<400> 74 

tggtgtcgac gcagagtagc ggccgct 

<210> 75 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 75 
ggcccgggcc ggcc 

<210> 76 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 76 

tcgaggccgc ccgggcc 

<210> 77 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 77 

cagccctcac tccttctc 

<210> 78 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 78 

ggtggggtct ttcattcc 

<210> 79 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 79 

ttcagcaagc cctgactc 

<210> 80 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> Description or Artificial Sequence: Primer 



<400> 80 

gccaccttcg aagcaatc 



18 



<210> 81 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 81 

gagcggtaca ggagaatg 18 

<210> 82 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 82 

gcccacccaa ccaaatca 18 

<210> 83 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 83 

accgcgttct tctgtaac 18 

<210> 84 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 84 

cagctaacag caggatcc 18 

<210> 85 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 85 

gacccaggct gcccc 



15 



<210> 86 
<211> 15 
<212> DNA 
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<213> Artificial S 



ence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 86 

ggtactcttg ttgag 



15 



<210> 87 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 87 

gagggtacca agcrtcgtgg tccgcaacca gaag 34 

<210> 88 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 89 
<2U> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 89 

gacaagatct cagctgaata gcgacccacc ctgtg 35 

<210> 90 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 91 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 91 

gcatctagag cggccgcgaa cgctgcttag atctcct 37 



<400> 88 

ctcagatctg aattcggcct gtccctcggt gaggag 



36 



<400> 90 

gcatctagag cggccgctca ggcctgtccc tcggtgagga g 



<210> 92 
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<211> 131 
<212> PRT 

<213> Mus musculus 
<400> 92 

Glu Val Phe Leu Asn Pro Leu Gly Arg Asp Leu Leu Ser lie Ser Asp 
15 10 15 

Gly Arg Xaa Arg Ala He His Arg Arg Gly His Asn Asp Gly Glu Asp 
20 25 30 

Ser Leu Thr His Thr Asp Val Ser Ser Phe Gin Thr Met Asp Gin Met 
35 40 45 

Val Ser Asn Met Arg Asn Tyr Met Gin Lys Leu Glu Arg Asn Thr Gly 
50 55 60 

Gin Leu Ser Val Asp Pro Asn Gly His Ser Phe Cys Ser Ser Ser Val 
65 70 75 30 

Met Thr Tyr Ser Lys He Gly Asp Glu Pro Pro Lys Val Phe His Ala 
85 90 95 

Ser Thr Gin Thr Arg Arg Ala Pro Gly Arg He Lys Glu Thr Arg Lys 
100 105 110 

Ala Met Arg Asp Ser Asp Ser Gly Thr Arg Lys His Gly Tyr Trp Ser 
115 120 125 

Ser Tyr Pro 
130 



<210> 93 
<211> 131 
<212> PRT 

<213> Mus musculus 
<400> 93 

Arg Ser Phe Ser Glu Pro Phe Gly Arg Asp Leu Leu Ser lie Ser Asp 
15 10 15' 

Gly Arg Gly Arg Ala His Asn Arg Arg Gly His Asn Asp Gly Glu Asp 
20 25 30 

Ser Leu Thr His Thr Asp Val Ser Ser Phe Gin Thr Met Asp Gin Met 
35 40 45 

Val Ser Asn Met Arg Asn Tyr Met Gin Lys Leu Glu Arg Asn Phe Gly 
50 55 60 

Gin Leu Ser Val Asp Pro Asn Gly His Ser Phe Cys Ser Ser Ser Val 
65 70 75 80 

Met Thr Tyr Ser Lys He Gly Asp Glu Pro Pro Lys Val Phe Gin Ala 
85 90 95 

Ser Thr Gin Thr Arg Arg Ala Pro Gly Gly He Lys Glu Thr Arg Lys 
100 105 110 

Ala Met Arg Asp Ser Asp Ser Gly Leu Glu Lys Met Ala He Gly His 
115 120 125 



Kis He His 
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130 



<210> 94 

<211> 159 

<212> PRT 

<213> Mus musculus 

<400> 94 

Ala Lys Tyr Val Cys Leu Ala Asn Lys Asn Cys Pro Val Asp Lys Arg 
15 10 15 

Arg Arg Asn Arg Cys Gin Tyr Cys Arg Phe Gin Lys Cys Leu Ala Val 
20 25 30 

Gly Met Val Lys Glu Xaa Val Arg Thr Asp Ser Leu Lys Gly Arg Arg 
35 40 45 

Gly Arg Leu Pro Ser Lys Pro Lys Ser Pro Gin Glu Pro Ser Pro Pro 
50 55 60 

Ser Pro Pro Val Ser Leu lie Ser Ala Leu Val Arg Ala His Val Asp 
65 70 75 80 

Ser Asn Pro Ala Met Thr Ser Leu Asp Tyr Ser Arg Phe Gin Ala Asn 
85 90 95 

Pro Asp Tyr Gin Met Ser Gly Asp Asp Thr Gin His lie Gin Gin Phe 
100 105 110 

Tyr Asp Leu Leu Thr Gly Ser Met Glu He He Arg Gly Trp Ala Xaa 
115 120 125 

Xaa He Pro Gly Phe Ala Asp Leu Pro Lys Ala Asp Gin Asp Leu Leu 
130 135 140 

Phe Glu Ser Ala Phe Leu Glu Leu Phe Val Leu Arg Leu Ala Tyr 
145 150 155 



<210> 95 

<2U> 140 

<212> PRT 

<213> Mus musculus 

<400> 95 

Ser Gin Thr Arg Gin Gin Gly Pro Leu Arg Ser lie Met Lys Asp Leu 
15 10 15- 

His Ser Asp Asp Asn Glu Glu Glu Ser Asp Glu Val Glu Asp Asn Asp 
20 25 30 

Asn Asp Ser Glu Met Glu Arg Pro Val Asn Arg Gly Gly Ser Arg Ser 
35 40 45 

Arg Arg Val Ser Leu Ser Asp Gly Ser Asp Ser Glu Ser Ser Ser Ala 
50 55 60 

Ser Ser Pro Leu His His Glu Pro Pro Pro Pro Leu Leu Lys Thr Asn 
65 70 75 80 

Asn Asn Gin He Leu Glu Val Lys Ser Pro He Lys Gin Ser Lys Ser 
85 90 95 
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Asp Lys Gin lie Ly. sn Gly Glu Cys Asp Lys Ala Tyr Leu . Glu 
100 105 110 

Leu Val.Glu Leu His Arg Arg Leu Met Thr Leu Arg Glu Arg His lie 
115 120 125 

Leu Gin Gin He Val Asn Leu He Glu Glu Thr Gly 
130 135 140 



<210> 96 
<211> 652 
<212> PRT 

<213> Mus musculus 
<400> 96 

Met Ala Thr Ser Met Gly Leu Leu Leu Leu Leu Leu Leu Leu Leu Thr 
15 10 15 

Gin Pro Gly Ala Gly Thr Gly Ala Asp Thr Glu Ala Val Val Cys Val 
20 25 30 

Gly Thr Ala Cys Tyr Thr Ala His Ser Gly Lys Leu Ser Ala Ala Glu 
35 40 45 

Ala Gin Asn His Cys Asn Gin Asn Gly Gly Asn Leu Ala Thr Val Lys 
50 55 60 

Ser Lys Glu Glu Ala Gin His Val Gin Arg Val Leu Ala Gin Leu Leu 
65 70 "?5 80 

Arg Arg Glu Ala Ala Leu Thr Ala Arg Met Ser Lys Phe Trp lie Gly 
85 90 95 

Leu Gin Arg Glu Lys Gly Lys Cys Leu Asp Pro Ser Leu Pro Leu Lys 
100 105 110 

Gly Phe Ser Trp Val Gly Gly Gly Glu Asp Thr Pro Tyr Ser Asn Trp 
115 120 125 

His Lys Glu Leu Arg Asn Ser Cys He Ser Lys Arg Cys Val Ser Leu 
130 135 140 

Leu Leu Asp Leu Ser Gin Pro Leu Leu Pro Asn Arg Leu Pro Lys Trp 
145 150 155 160 

Ser Glu Gly Pro Cys Gly Ser Pro Gly Ser Pro Gly Ser Asn He Glu 
165 170 175 

Gly Phe Val Cys Lys Phe Ser Phe Lys Gly Met Cys Arg Pro Leu Ala 
180 185 190 

Leu Gly Gly Pro Gly Gin Val Thr Tyr Thr Thr Pro Phe Gin Thr Thr 
195 200 205 

Ser Ser Ser Leu Glu Ala Val Pro Phe Ala Ser Ala Ala Asn Val Ala 
210 215 220 

Cys Gly Glu Gly Asp Lys Asp Glu Thr Gin Ser His Tyr Phe Leu Cys 
225 230 235 240 

Lys Glu Lys Ala Pro Asp Val Phe Asp Trp Gly Ser Ser Gly Pro Leu 
245 250 255 
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Cys 



Val 



Ser 



Pro 
260 



yr 



Gly 



Cys 



Asn 
265 



Phe Asn 



Asn 



Gly 



Gly 
270 



V. 



His 



Gin Asp Cys Phe Glu Gly Gly Asp Gly Ser Phe Leu Cys Gly Cys Arg 
275 280 285 

Pro Gly Arg Arg Leu Leu Asp Asp Leu Val Thr Cys Ala Ser Arg Asn 
290 295 300 

Pro Cys Ser Ser Ser Pro Cys Arg Gly Gly Ala Thr Cys Val Leu Gly 
305 310 315 320 

Pro His Gly Lys Asn Tyr Thr Cys Arg Cys Pro Gin Gly Tyr Gin Leu 
325 330 335 

Asp Ser Ser Gin Leu Asp Cys Val Asp Val Asp Glu Cys Gin Asp Ser 
340 345 350 

Pro Cys Ala Gin Glu Cys Val Asn Thr Pro Gly Gly Phe Arg Cys Glu 
355 360 365 

Cys Trp Val Gly Tyr Glu Pro Gly Gly Pro Gly Glu Gly Ala Cys Gin 
370 375 380 

Asp Val Asp Glu Cys Ala Leu Gly Arg Ser Pro Cys Ala Gin Gly Cys 
385 390 395 400 

Thr Asn Thr Asp Gly Ser Phe His Cys Ser Cys Glu Glu Gly Tyr Val 
405 410 415 

Leu Ala Gly Glu Asp Gly Thr Gin Cys Gin Asp Val Asp Glu Cys Val 
420 425 430 

Gly Pro Gly Gly Pro Leu Cys Asp Ser Leu Cys Phe Asn Thr Gin Gly 
435 440 445 

Ser Phe His Cys Gly Cys Leu Pro Gly Trp Val Leu Ala Pro Asn Gly 
450 455 460 

Val Ser Cys Thr Met Gly Pro Val Ser Leu Gly Pro Pro Ser Gly Pro 
465 470 475 480 

Pro Asp Glu Glu Asp Lys Gly Glu Lys Glu Gly Ser Thr Val Pro Arg 
485 490 495 

Ala Ala Thr Ala Ser Pro Thr Arg Gly Pro Glu Gly Thr Pro Lys Ala 
500 505 510 

Thr Pro Thr Thr Ser Arg Pro Ser Leu Ser Ser Asp Ala Pro lie Thr 
515 520 525 

Ser Ala Pro Leu Lys Met Leu Ala Pro Ser Gly Ser Ser Gly Val Trp 
530 535 540 

Arg Glu Pro Ser He His His Ala Thr Ala Ala Ser Gly Pro Gin Glu 
545 550 555 560 

Pro Ala Gly Gly Asp Ser Ser Val Ala Thr Gin Asn Asn Asp Gly Thr 
565 570 575 

Asp Gly Gin Lys Leu Leu Leu Phe Tyr He Leu Gly Thr Val Val Ala 
580 585 590 

He Leu Leu Leu Leu Ala Leu Ala Leu Gly Leu Leu Val Tyr Arg Lys 
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595 600 ff0> 

Arg Arg Ala Lys Arg Glu Glu Lys Lys Glu Lys Lys Pro Gin Asn Ala 
610 615 620 

Ala Asp Ser Tyr Ser Trp Val Pro Glu Arg Ala Glu Ser Arg Ala Met 
625 630 635 640 

Glu Asn Gin Tyr Ser Pro Thr Pro Gly Thr Asp Cys 
645 650 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. [Xl Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

. Remark: Although claims 46-49 

are directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged 
effects of the compound/composition. 



2. X Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ ISA/2 10 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheets 



1 . I | As all required additional search fee3 were timely paid by the applicant, this International Search Report covers all 
' — ' searchable claims. 

2. | [ As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this international Search Report 
I 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. 1 X I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report i 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

claims 1-5,7-20,22-31,42-50)- partial 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1 )) (July 1998) 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 2 10 



Continuation of Box 1.2 

Claims 31,42,43,46,50 refer to modulators of the polypeptides without 
giving a true technical characterization. Moreover, no such compounds 
are defined in the application. In consequence, the scope of said claims 
is ambiguous and vague, and their subject-matter is not sufficiently 
di seised and supported (Art. 5 and 6) PCT. No search can be performed 
for such purely speculative claims whose wording is, in fact, a mere 
recitation of the results to be achieved. 

Claims 18-19 are unclear with regard to their dependence from claim 16 
and to their scope. 

Claims 2,10,44 are inconsistent, in that Seq. ID 42 defines an amino acid 
sequence and not a nucleic acid. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

1. Claims: (1-5,7-20,22-31,42-50) - partial 

An isolated nucleic acid derived from an isolated 
hematopoietic stem cell, expressed in and specific to a 
hematopoietic stem cell. 

Said nucleic acid capable of hybridizing to seq. ID 1. 

Corresponding vectors, host systems, polypeptides, 
antibodies and antobody-prpducing cells, detection methods, 
modulators and methods to identify them, therapeutic and 
pharmecautic applications. 

2. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 1. 

3. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 3 

4. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 4 

5. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 5 

6. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 6 

7. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 7 

8. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 8 

9. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 9 

10. Claims: (1-5,7-20,22-31,42-50) - partial 
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Idem as subject matter 1, but limited to seq. ID 10 

11. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 11 

12. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 12 

13. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 13 

14. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 14 

15. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 15 

16. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 16 

17. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 17 

18. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 18 

19. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 19 

20. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 20 

21. Claims: (1-5,7-20,22-31,42-50) - partial 
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Idem as subject matter 1, but limited to seq. ID 21 

22. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 22 

23. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 23 

24. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 24 

25. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 25 

25. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 26 

27. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 27 

28. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 28 

29. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 29 

30. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 30 

31. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 31 



32. Claims: (1-5,7-20,22-31,42-50) - partial 
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Idem as subject matter 1, but limited to seq. ID 32 

33. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 33 

34. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 34 

35. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 35 

36. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 36 

37. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 37 

38. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 38 

39. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 39 

40. Claims: (1-5,7-20,22-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 40 

41. Claims: (1-5,7-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 41 (and for 
the polypeptide, seq. ID 42) 

42. Claims: (1-5,7-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 43 (and for 
the polypeptide, seq. ID 44) 
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43. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 46) 


ID 45 (and for 


44. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 48) 


ID 47 (and for 


45. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 50) 


ID 49 (and for 


46. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 52) 


ID 51 (and for 


47. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 54) 


ID 53 (and for 


48. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 56) 


ID 55 (and for 


49. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 58) 


ID 57 (and for 


50. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 60) 


ID 59 (and for 


51. 


Claims: (1-5,7-31,42-50) - partial 






Idem as subject matter 1, but limited to seq. 
the polypeptide, seq. ID 62) 


ID 61 (and for 
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52. Claims: (1-5,7-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 63 (and for 
the polypeptide, seq. ID 64) 

53. Claims: (1-5,7-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 65 (and for 
the polypeptide, seq. ID 66) 

54. Claims: (1-5,7-31,42-50) - partial 

Idem as subject matter 1, but limited to seq. ID 67 (and for 
the polypeptide, seq. ID 68) 

55. Claims: (17,21) - partial 

An isolated hematopoietic stem cell -specific protein or 
portion thereof. 

Said protein characterized by seq. ID 70 

56. Claims: (17,21) - partial 

Idem as subject matter 56, but limited to seq. ID 71 

57. Claims: (1-5,7-20,22-31,42-50) - partial; (6) - complete 

Idem as subject matter 1, but limited to seq. ID 72 (and for 
the polypeptide, seq. ID 73) 

58. Claims: (32-41) - complete 

A method for identifying primitive hematopoietic stem 
cell-specific nucleic acids. 
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